Warning: Permanently added '2620:52:3:1:dead:beef:cafe:c1d7' (ED25519) to the list of known hosts. You can reproduce this build on your computer by running: sudo dnf install copr-rpmbuild /usr/bin/copr-rpmbuild --verbose --drop-resultdir --task-url https://copr.fedorainfracloud.org/backend/get-build-task/9393549-fedora-rawhide-x86_64 --chroot fedora-rawhide-x86_64 Version: 1.3 PID: 3410 Logging PID: 3411 Task: {'allow_user_ssh': False, 'appstream': False, 'background': False, 'build_id': 9393549, 'buildroot_pkgs': [], 'chroot': 'fedora-rawhide-x86_64', 'enable_net': False, 'fedora_review': False, 'git_hash': '00fd1b33ac0c86ebe1f8b27ec802a3a7c7826c07', 'git_repo': 'https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/rccl', 'isolation': 'default', 'memory_reqs': 2048, 'package_name': 'rccl', 'package_version': '6.4.2-3', 'project_dirname': 'RH', 'project_name': 'RH', 'project_owner': '@rocm-packagers-sig', 'repo_priority': None, 'repos': [{'baseurl': 'https://download.copr.fedorainfracloud.org/results/@rocm-packagers-sig/RH/fedora-rawhide-x86_64/', 'id': 'copr_base', 'name': 'Copr repository', 'priority': None}], 'sandbox': '@rocm-packagers-sig/RH--trix', 'source_json': {}, 'source_type': None, 'ssh_public_keys': None, 'storage': 0, 'submitter': 'trix', 'tags': [], 'task_id': '9393549-fedora-rawhide-x86_64', 'timeout': 180000, 'uses_devel_repo': False, 'with_opts': [], 'without_opts': []} Running: git clone https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/rccl /var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl --depth 500 --no-single-branch --recursive cmd: ['git', 'clone', 'https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/rccl', '/var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl', '--depth', '500', '--no-single-branch', '--recursive'] cwd: . rc: 0 stdout: stderr: Cloning into '/var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl'... Running: git checkout 00fd1b33ac0c86ebe1f8b27ec802a3a7c7826c07 -- cmd: ['git', 'checkout', '00fd1b33ac0c86ebe1f8b27ec802a3a7c7826c07', '--'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl rc: 0 stdout: stderr: Note: switching to '00fd1b33ac0c86ebe1f8b27ec802a3a7c7826c07'. You are in 'detached HEAD' state. You can look around, make experimental changes and commit them, and you can discard any commits you make in this state without impacting any branches by switching back to a branch. If you want to create a new branch to retain commits you create, you may do so (now or later) by using -c with the switch command. Example: git switch -c Or undo this operation with: git switch - Turn off this advice by setting config variable advice.detachedHead to false HEAD is now at 00fd1b3 automatic import of rccl Running: dist-git-client sources cmd: ['dist-git-client', 'sources'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl rc: 0 stdout: stderr: INFO: Reading stdout from command: git rev-parse --abbrev-ref HEAD INFO: Reading stdout from command: git rev-parse HEAD INFO: Reading sources specification file: sources INFO: Downloading RCCL-6.4.2.tar.gz INFO: Reading stdout from command: curl --help all tail: /var/lib/copr-rpmbuild/main.log: file truncated INFO: Calling: curl -H Pragma: -o RCCL-6.4.2.tar.gz --location --connect-timeout 60 --retry 3 --retry-delay 10 --remote-time --show-error --fail --retry-all-errors https://copr-dist-git.fedorainfracloud.org/repo/pkgs/@rocm-packagers-sig/RH/rccl/RCCL-6.4.2.tar.gz/md5/5323c56546d4e3634f04898820e8816c/RCCL-6.4.2.tar.gz % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 1851k 100 1851k 0 0 11.1M 0 --:--:-- --:--:-- --:--:-- 11.2M INFO: Reading stdout from command: md5sum RCCL-6.4.2.tar.gz Running (timeout=180000): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1754764510.102582 -r /var/lib/copr-rpmbuild/results/configs/child.cfg INFO: mock.py version 6.3 starting (python version = 3.13.5, NVR = mock-6.3-1.fc42), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1754764510.102582 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start(bootstrap): init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish(bootstrap): init plugins Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl/rccl.spec) Config(fedora-rawhide-x86_64) Start: clean chroot Finish: clean chroot Mock Version: 6.3 INFO: Mock Version: 6.3 Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1754764510.102582/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata INFO: Guessed host environment type: unknown INFO: Using container image: registry.fedoraproject.org/fedora:rawhide INFO: Pulling image: registry.fedoraproject.org/fedora:rawhide INFO: Tagging container image as mock-bootstrap-fddf8eab-f334-4ad2-8e7c-7efa6861a97b INFO: Checking that f52e15e602dbbc3292e0a4199d7da0a5035963e197a3f242e648a7ba232a1abc image matches host's architecture INFO: Copy content of container f52e15e602dbbc3292e0a4199d7da0a5035963e197a3f242e648a7ba232a1abc to /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1754764510.102582/root INFO: mounting f52e15e602dbbc3292e0a4199d7da0a5035963e197a3f242e648a7ba232a1abc with podman image mount INFO: image f52e15e602dbbc3292e0a4199d7da0a5035963e197a3f242e648a7ba232a1abc as /var/lib/containers/storage/overlay/21c8c0c1f56ab3d34cc60793f568273a367bcda592e2981045ac8a90fe0e369b/merged INFO: umounting image f52e15e602dbbc3292e0a4199d7da0a5035963e197a3f242e648a7ba232a1abc (/var/lib/containers/storage/overlay/21c8c0c1f56ab3d34cc60793f568273a367bcda592e2981045ac8a90fe0e369b/merged) with podman image umount INFO: Removing image mock-bootstrap-fddf8eab-f334-4ad2-8e7c-7efa6861a97b INFO: Package manager dnf5 detected and used (fallback) INFO: Not updating bootstrap chroot, bootstrap_image_ready=True Start(bootstrap): creating root cache Finish(bootstrap): creating root cache Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-1754764510.102582/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf5 detected and used (direct choice) INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-5.99.91-3.fc43.x86_64 rpm-sequoia-1.9.0-2.fc43.x86_64 dnf5-5.2.16.0-1.fc43.x86_64 dnf5-plugins-5.2.16.0-1.fc43.x86_64 Start: installing minimal buildroot with dnf5 Updating and loading repositories: Copr repository 100% | 1.4 MiB/s | 214.7 KiB | 00m00s fedora 100% | 13.3 MiB/s | 22.1 MiB | 00m02s Repositories loaded. Package Arch Version Repository Size Installing group/module packages: bash x86_64 5.3.0-2.fc43 fedora 8.4 MiB bzip2 x86_64 1.0.8-21.fc43 fedora 95.3 KiB coreutils x86_64 9.7-5.fc43 fedora 5.4 MiB cpio x86_64 2.15-6.fc43 fedora 1.1 MiB diffutils x86_64 3.12-3.fc43 fedora 1.6 MiB fedora-release-common noarch 43-0.19 fedora 20.4 KiB findutils x86_64 1:4.10.0-6.fc43 fedora 1.8 MiB gawk x86_64 5.3.2-2.fc43 fedora 1.8 MiB glibc-minimal-langpack x86_64 2.42-2.fc43 fedora 0.0 B grep x86_64 3.12-2.fc43 fedora 1.0 MiB gzip x86_64 1.13-4.fc43 fedora 388.8 KiB info x86_64 7.2-6.fc43 fedora 353.9 KiB patch x86_64 2.8-2.fc43 fedora 222.8 KiB redhat-rpm-config noarch 343-10.fc43 fedora 182.9 KiB rpm-build x86_64 5.99.91-3.fc43 fedora 285.5 KiB sed x86_64 4.9-5.fc43 fedora 857.3 KiB shadow-utils x86_64 2:4.18.0-3.fc43 fedora 3.9 MiB tar x86_64 2:1.35-6.fc43 fedora 2.9 MiB unzip x86_64 6.0-67.fc43 fedora 386.3 KiB util-linux x86_64 2.41.1-15.fc43 fedora 3.5 MiB which x86_64 2.23-3.fc43 fedora 83.5 KiB xz x86_64 1:5.8.1-2.fc43 fedora 1.3 MiB Installing dependencies: add-determinism x86_64 0.6.0-2.fc43 fedora 2.4 MiB alternatives x86_64 1.33-2.fc43 fedora 62.2 KiB ansible-srpm-macros noarch 1-18.1.fc43 fedora 35.7 KiB audit-libs x86_64 4.1.1-1.fc43 fedora 378.8 KiB binutils x86_64 2.45-1.fc43 fedora 26.5 MiB build-reproducibility-srpm-macros noarch 0.6.0-2.fc43 fedora 735.0 B bzip2-libs x86_64 1.0.8-21.fc43 fedora 80.6 KiB ca-certificates noarch 2024.2.69_v8.0.401-7.fc43 fedora 2.6 MiB coreutils-common x86_64 9.7-5.fc43 fedora 11.3 MiB crypto-policies noarch 20250714-3.gitcd6043a.fc43 fedora 146.9 KiB curl x86_64 8.15.0-2.fc43 fedora 473.6 KiB cyrus-sasl-lib x86_64 2.1.28-33.fc43 fedora 2.3 MiB debugedit x86_64 5.2-2.fc43 fedora 197.8 KiB dwz x86_64 0.16-2.fc43 fedora 287.1 KiB ed x86_64 1.22-2.fc43 fedora 147.1 KiB efi-srpm-macros noarch 6-4.fc43 fedora 40.1 KiB elfutils x86_64 0.193-3.fc43 fedora 2.9 MiB elfutils-debuginfod-client x86_64 0.193-3.fc43 fedora 83.9 KiB elfutils-default-yama-scope noarch 0.193-3.fc43 fedora 1.8 KiB elfutils-libelf x86_64 0.193-3.fc43 fedora 1.2 MiB elfutils-libs x86_64 0.193-3.fc43 fedora 683.4 KiB fedora-gpg-keys noarch 43-0.2 fedora 129.0 KiB fedora-release noarch 43-0.19 fedora 0.0 B fedora-release-identity-basic noarch 43-0.19 fedora 664.0 B fedora-repos noarch 43-0.2 fedora 4.9 KiB fedora-repos-rawhide noarch 43-0.2 fedora 2.2 KiB file x86_64 5.46-6.fc43 fedora 100.2 KiB file-libs x86_64 5.46-6.fc43 fedora 11.9 MiB filesystem x86_64 3.18-50.fc43 fedora 112.0 B filesystem-srpm-macros noarch 3.18-50.fc43 fedora 38.2 KiB fonts-srpm-macros noarch 1:2.0.5-23.fc43 fedora 55.8 KiB forge-srpm-macros noarch 0.4.0-3.fc43 fedora 38.9 KiB fpc-srpm-macros noarch 1.3-15.fc43 fedora 144.0 B gdb-minimal x86_64 16.3-4.fc43 fedora 13.2 MiB gdbm-libs x86_64 1:1.23-10.fc43 fedora 129.9 KiB ghc-srpm-macros noarch 1.9.2-3.fc43 fedora 779.0 B glibc x86_64 2.42-2.fc43 fedora 6.7 MiB glibc-common x86_64 2.42-2.fc43 fedora 1.0 MiB glibc-gconv-extra x86_64 2.42-2.fc43 fedora 7.2 MiB gmp x86_64 1:6.3.0-4.fc43 fedora 811.2 KiB gnat-srpm-macros noarch 6-8.fc43 fedora 1.0 KiB gnupg2 x86_64 2.4.8-4.fc43 fedora 6.5 MiB gnupg2-dirmngr x86_64 2.4.8-4.fc43 fedora 618.4 KiB gnupg2-gpg-agent x86_64 2.4.8-4.fc43 fedora 671.4 KiB gnupg2-gpgconf x86_64 2.4.8-4.fc43 fedora 250.0 KiB gnupg2-keyboxd x86_64 2.4.8-4.fc43 fedora 201.4 KiB gnupg2-verify x86_64 2.4.8-4.fc43 fedora 348.5 KiB gnutls x86_64 3.8.10-3.fc43 fedora 3.8 MiB go-srpm-macros noarch 3.7.0-4.fc43 fedora 61.6 KiB gpgverify noarch 2.2-3.fc43 fedora 8.7 KiB ima-evm-utils-libs x86_64 1.6.2-6.fc43 fedora 60.7 KiB jansson x86_64 2.14-3.fc43 fedora 89.1 KiB java-srpm-macros noarch 1-7.fc43 fedora 870.0 B json-c x86_64 0.18-7.fc43 fedora 82.7 KiB kernel-srpm-macros noarch 1.0-27.fc43 fedora 1.9 KiB keyutils-libs x86_64 1.6.3-6.fc43 fedora 54.3 KiB krb5-libs x86_64 1.21.3-7.fc43 fedora 2.3 MiB libacl x86_64 2.3.2-4.fc43 fedora 35.9 KiB libarchive x86_64 3.8.1-3.fc43 fedora 951.1 KiB libassuan x86_64 2.5.7-4.fc43 fedora 163.8 KiB libattr x86_64 2.5.2-6.fc43 fedora 24.4 KiB libblkid x86_64 2.41.1-15.fc43 fedora 262.4 KiB libbrotli x86_64 1.1.0-8.fc43 fedora 833.3 KiB libcap x86_64 2.76-2.fc43 fedora 209.1 KiB libcap-ng x86_64 0.8.5-6.fc43 fedora 68.9 KiB libcom_err x86_64 1.47.3-2.fc43 fedora 63.1 KiB libcurl x86_64 8.15.0-2.fc43 fedora 903.2 KiB libeconf x86_64 0.7.9-2.fc43 fedora 64.9 KiB libevent x86_64 2.1.12-16.fc43 fedora 883.1 KiB libfdisk x86_64 2.41.1-15.fc43 fedora 380.4 KiB libffi x86_64 3.5.1-2.fc43 fedora 83.6 KiB libfsverity x86_64 1.6-3.fc43 fedora 28.5 KiB libgcc x86_64 15.2.1-1.fc43 copr_base 266.6 KiB libgcrypt x86_64 1.11.1-2.fc43 fedora 1.6 MiB libgomp x86_64 15.2.1-1.fc43 copr_base 541.1 KiB libgpg-error x86_64 1.55-2.fc43 fedora 915.3 KiB libidn2 x86_64 2.3.8-2.fc43 fedora 552.5 KiB libksba x86_64 1.6.7-4.fc43 fedora 398.5 KiB liblastlog2 x86_64 2.41.1-15.fc43 fedora 33.9 KiB libmount x86_64 2.41.1-15.fc43 fedora 372.7 KiB libnghttp2 x86_64 1.66.0-2.fc43 fedora 162.2 KiB libpkgconf x86_64 2.3.0-3.fc43 fedora 78.1 KiB libpsl x86_64 0.21.5-6.fc43 fedora 76.4 KiB libselinux x86_64 3.9-2.fc43 fedora 193.1 KiB libsemanage x86_64 3.9-2.fc43 fedora 308.4 KiB libsepol x86_64 3.9-2.fc43 fedora 822.0 KiB libsmartcols x86_64 2.41.1-15.fc43 fedora 180.5 KiB libssh x86_64 0.11.2-2.fc43 fedora 566.7 KiB libssh-config noarch 0.11.2-2.fc43 fedora 277.0 B libstdc++ x86_64 15.2.1-1.fc43 copr_base 2.8 MiB libtasn1 x86_64 4.20.0-2.fc43 fedora 176.3 KiB libtool-ltdl x86_64 2.5.4-7.fc43 fedora 70.1 KiB libunistring x86_64 1.1-10.fc43 fedora 1.7 MiB libusb1 x86_64 1.0.29-2.fc43 fedora 171.3 KiB libuuid x86_64 2.41.1-15.fc43 fedora 37.4 KiB libverto x86_64 0.3.2-11.fc43 fedora 25.4 KiB libxcrypt x86_64 4.4.38-8.fc43 fedora 284.5 KiB libxml2 x86_64 2.12.10-3.fc43 fedora 1.7 MiB libzstd x86_64 1.5.7-2.fc43 fedora 799.9 KiB lua-libs x86_64 5.4.8-2.fc43 fedora 280.8 KiB lua-srpm-macros noarch 1-16.fc43 fedora 1.3 KiB lz4-libs x86_64 1.10.0-3.fc43 fedora 161.4 KiB mpfr x86_64 4.2.2-2.fc43 fedora 832.8 KiB ncurses-base noarch 6.5-7.20250614.fc43 fedora 328.1 KiB ncurses-libs x86_64 6.5-7.20250614.fc43 fedora 946.3 KiB nettle x86_64 3.10.1-2.fc43 fedora 790.6 KiB npth x86_64 1.8-3.fc43 fedora 49.6 KiB ocaml-srpm-macros noarch 11-2.fc43 fedora 1.9 KiB openblas-srpm-macros noarch 2-20.fc43 fedora 112.0 B openldap x86_64 2.6.10-3.fc43 fedora 655.8 KiB openssl-libs x86_64 1:3.5.1-2.fc43 fedora 8.9 MiB p11-kit x86_64 0.25.5-9.fc43 fedora 2.2 MiB p11-kit-trust x86_64 0.25.5-9.fc43 fedora 395.5 KiB package-notes-srpm-macros noarch 0.5-14.fc43 fedora 1.6 KiB pam-libs x86_64 1.7.1-3.fc43 fedora 126.8 KiB pcre2 x86_64 10.45-1.fc43.1 fedora 697.7 KiB pcre2-syntax noarch 10.45-1.fc43.1 fedora 273.9 KiB perl-srpm-macros noarch 1-60.fc43 fedora 861.0 B pkgconf x86_64 2.3.0-3.fc43 fedora 88.5 KiB pkgconf-m4 noarch 2.3.0-3.fc43 fedora 14.4 KiB pkgconf-pkg-config x86_64 2.3.0-3.fc43 fedora 989.0 B popt x86_64 1.19-9.fc43 fedora 132.8 KiB publicsuffix-list-dafsa noarch 20250616-2.fc43 fedora 69.1 KiB pyproject-srpm-macros noarch 1.18.3-2.fc43 fedora 1.9 KiB python-srpm-macros noarch 3.14-4.fc43 fedora 51.5 KiB qt5-srpm-macros noarch 5.15.17-2.fc43 fedora 500.0 B qt6-srpm-macros noarch 6.9.1-2.fc43 fedora 464.0 B readline x86_64 8.3-2.fc43 fedora 511.7 KiB rpm x86_64 5.99.91-3.fc43 fedora 3.0 MiB rpm-build-libs x86_64 5.99.91-3.fc43 fedora 268.4 KiB rpm-libs x86_64 5.99.91-3.fc43 fedora 933.8 KiB rpm-sequoia x86_64 1.9.0-2.fc43 fedora 2.5 MiB rpm-sign-libs x86_64 5.99.91-3.fc43 fedora 39.7 KiB rust-srpm-macros noarch 26.3-5.fc43 fedora 4.8 KiB setup noarch 2.15.0-26.fc43 fedora 725.0 KiB sqlite-libs x86_64 3.50.2-2.fc43 fedora 1.5 MiB systemd-libs x86_64 257.7-1.fc43 fedora 2.2 MiB systemd-standalone-sysusers x86_64 257.7-1.fc43 fedora 277.3 KiB tpm2-tss x86_64 4.1.3-8.fc43 fedora 1.6 MiB tree-sitter-srpm-macros noarch 0.4.2-1.fc43 fedora 8.3 KiB util-linux-core x86_64 2.41.1-15.fc43 fedora 1.5 MiB xxhash-libs x86_64 0.8.3-3.fc43 fedora 90.2 KiB xz-libs x86_64 1:5.8.1-2.fc43 fedora 217.8 KiB zig-srpm-macros noarch 1-5.fc43 fedora 1.1 KiB zip x86_64 3.0-44.fc43 fedora 694.5 KiB zlib-ng-compat x86_64 2.2.5-1.fc43 fedora 137.6 KiB zstd x86_64 1.5.7-2.fc43 fedora 1.7 MiB Installing groups: Buildsystem building group Transaction Summary: Installing: 169 packages Total size of inbound packages is 59 MiB. Need to download 59 MiB. After this operation, 198 MiB extra will be used (install 198 MiB, remove 0 B). [ 1/169] bzip2-0:1.0.8-21.fc43.x86_64 100% | 333.1 KiB/s | 51.6 KiB | 00m00s [ 2/169] cpio-0:2.15-6.fc43.x86_64 100% | 2.1 MiB/s | 293.1 KiB | 00m00s [ 3/169] coreutils-0:9.7-5.fc43.x86_64 100% | 3.6 MiB/s | 1.1 MiB | 00m00s [ 4/169] bash-0:5.3.0-2.fc43.x86_64 100% | 5.5 MiB/s | 1.9 MiB | 00m00s [ 5/169] fedora-release-common-0:43-0. 100% | 726.3 KiB/s | 25.4 KiB | 00m00s [ 6/169] diffutils-0:3.12-3.fc43.x86_6 100% | 5.0 MiB/s | 392.3 KiB | 00m00s [ 7/169] findutils-1:4.10.0-6.fc43.x86 100% | 12.2 MiB/s | 550.0 KiB | 00m00s [ 8/169] glibc-minimal-langpack-0:2.42 100% | 1.0 MiB/s | 37.1 KiB | 00m00s [ 9/169] grep-0:3.12-2.fc43.x86_64 100% | 6.2 MiB/s | 299.1 KiB | 00m00s [ 10/169] gzip-0:1.13-4.fc43.x86_64 100% | 4.5 MiB/s | 170.1 KiB | 00m00s [ 11/169] info-0:7.2-6.fc43.x86_64 100% | 4.6 MiB/s | 182.9 KiB | 00m00s [ 12/169] patch-0:2.8-2.fc43.x86_64 100% | 2.8 MiB/s | 113.8 KiB | 00m00s [ 13/169] redhat-rpm-config-0:343-10.fc 100% | 2.2 MiB/s | 79.5 KiB | 00m00s [ 14/169] rpm-build-0:5.99.91-3.fc43.x8 100% | 3.5 MiB/s | 135.7 KiB | 00m00s [ 15/169] sed-0:4.9-5.fc43.x86_64 100% | 6.1 MiB/s | 317.1 KiB | 00m00s [ 16/169] shadow-utils-2:4.18.0-3.fc43. 100% | 21.0 MiB/s | 1.3 MiB | 00m00s [ 17/169] tar-2:1.35-6.fc43.x86_64 100% | 14.2 MiB/s | 856.4 KiB | 00m00s [ 18/169] unzip-0:6.0-67.fc43.x86_64 100% | 4.2 MiB/s | 183.7 KiB | 00m00s [ 19/169] which-0:2.23-3.fc43.x86_64 100% | 1.1 MiB/s | 41.7 KiB | 00m00s [ 20/169] xz-1:5.8.1-2.fc43.x86_64 100% | 11.6 MiB/s | 572.5 KiB | 00m00s [ 21/169] util-linux-0:2.41.1-15.fc43.x 100% | 20.5 MiB/s | 1.2 MiB | 00m00s [ 22/169] gawk-0:5.3.2-2.fc43.x86_64 100% | 13.1 MiB/s | 1.1 MiB | 00m00s [ 23/169] filesystem-0:3.18-50.fc43.x86 100% | 18.8 MiB/s | 1.3 MiB | 00m00s [ 24/169] bzip2-libs-0:1.0.8-21.fc43.x8 100% | 1.0 MiB/s | 43.1 KiB | 00m00s [ 25/169] ncurses-libs-0:6.5-7.20250614 100% | 6.5 MiB/s | 332.7 KiB | 00m00s [ 26/169] glibc-0:2.42-2.fc43.x86_64 100% | 29.4 MiB/s | 2.2 MiB | 00m00s [ 27/169] gmp-1:6.3.0-4.fc43.x86_64 100% | 6.8 MiB/s | 319.3 KiB | 00m00s [ 28/169] libacl-0:2.3.2-4.fc43.x86_64 100% | 564.6 KiB/s | 24.3 KiB | 00m00s [ 29/169] coreutils-common-0:9.7-5.fc43 100% | 26.3 MiB/s | 2.1 MiB | 00m00s [ 30/169] libattr-0:2.5.2-6.fc43.x86_64 100% | 510.1 KiB/s | 17.9 KiB | 00m00s [ 31/169] libcap-0:2.76-2.fc43.x86_64 100% | 2.3 MiB/s | 86.9 KiB | 00m00s [ 32/169] libselinux-0:3.9-2.fc43.x86_6 100% | 2.5 MiB/s | 97.2 KiB | 00m00s [ 33/169] fedora-repos-0:43-0.2.noarch 100% | 219.7 KiB/s | 9.2 KiB | 00m00s [ 34/169] openssl-libs-1:3.5.1-2.fc43.x 100% | 23.1 MiB/s | 2.6 MiB | 00m00s [ 35/169] glibc-common-0:2.42-2.fc43.x8 100% | 7.7 MiB/s | 323.9 KiB | 00m00s [ 36/169] systemd-libs-0:257.7-1.fc43.x 100% | 5.8 MiB/s | 789.7 KiB | 00m00s [ 37/169] pcre2-0:10.45-1.fc43.1.x86_64 100% | 6.6 MiB/s | 261.9 KiB | 00m00s [ 38/169] ed-0:1.22-2.fc43.x86_64 100% | 2.1 MiB/s | 83.2 KiB | 00m00s [ 39/169] ansible-srpm-macros-0:1-18.1. 100% | 553.0 KiB/s | 19.9 KiB | 00m00s [ 40/169] build-reproducibility-srpm-ma 100% | 328.4 KiB/s | 11.8 KiB | 00m00s [ 41/169] dwz-0:0.16-2.fc43.x86_64 100% | 3.5 MiB/s | 135.5 KiB | 00m00s [ 42/169] efi-srpm-macros-0:6-4.fc43.no 100% | 622.0 KiB/s | 22.4 KiB | 00m00s [ 43/169] file-0:5.46-6.fc43.x86_64 100% | 1.3 MiB/s | 48.7 KiB | 00m00s [ 44/169] filesystem-srpm-macros-0:3.18 100% | 713.9 KiB/s | 26.4 KiB | 00m00s [ 45/169] fonts-srpm-macros-1:2.0.5-23. 100% | 754.8 KiB/s | 27.2 KiB | 00m00s [ 46/169] forge-srpm-macros-0:0.4.0-3.f 100% | 558.0 KiB/s | 20.1 KiB | 00m00s [ 47/169] fpc-srpm-macros-0:1.3-15.fc43 100% | 213.3 KiB/s | 7.9 KiB | 00m00s [ 48/169] ghc-srpm-macros-0:1.9.2-3.fc4 100% | 249.9 KiB/s | 8.7 KiB | 00m00s [ 49/169] gnat-srpm-macros-0:6-8.fc43.n 100% | 235.7 KiB/s | 8.5 KiB | 00m00s [ 50/169] go-srpm-macros-0:3.7.0-4.fc43 100% | 760.6 KiB/s | 28.1 KiB | 00m00s [ 51/169] java-srpm-macros-0:1-7.fc43.n 100% | 220.6 KiB/s | 7.9 KiB | 00m00s [ 52/169] kernel-srpm-macros-0:1.0-27.f 100% | 247.8 KiB/s | 8.9 KiB | 00m00s [ 53/169] lua-srpm-macros-0:1-16.fc43.n 100% | 236.7 KiB/s | 8.8 KiB | 00m00s [ 54/169] ocaml-srpm-macros-0:11-2.fc43 100% | 257.2 KiB/s | 9.3 KiB | 00m00s [ 55/169] openblas-srpm-macros-0:2-20.f 100% | 211.0 KiB/s | 7.6 KiB | 00m00s [ 56/169] package-notes-srpm-macros-0:0 100% | 242.8 KiB/s | 9.0 KiB | 00m00s [ 57/169] perl-srpm-macros-0:1-60.fc43. 100% | 230.3 KiB/s | 8.3 KiB | 00m00s [ 58/169] pyproject-srpm-macros-0:1.18. 100% | 371.3 KiB/s | 13.4 KiB | 00m00s [ 59/169] python-srpm-macros-0:3.14-4.f 100% | 631.0 KiB/s | 23.3 KiB | 00m00s [ 60/169] qt5-srpm-macros-0:5.15.17-2.f 100% | 247.5 KiB/s | 8.7 KiB | 00m00s [ 61/169] qt6-srpm-macros-0:6.9.1-2.fc4 100% | 259.2 KiB/s | 9.3 KiB | 00m00s [ 62/169] rust-srpm-macros-0:26.3-5.fc4 100% | 310.6 KiB/s | 11.2 KiB | 00m00s [ 63/169] rpm-0:5.99.91-3.fc43.x86_64 100% | 12.5 MiB/s | 562.8 KiB | 00m00s [ 64/169] tree-sitter-srpm-macros-0:0.4 100% | 370.8 KiB/s | 13.4 KiB | 00m00s [ 65/169] zig-srpm-macros-0:1-5.fc43.no 100% | 241.0 KiB/s | 8.4 KiB | 00m00s [ 66/169] zip-0:3.0-44.fc43.x86_64 100% | 6.4 MiB/s | 261.6 KiB | 00m00s [ 67/169] debugedit-0:5.2-2.fc43.x86_64 100% | 2.2 MiB/s | 80.6 KiB | 00m00s [ 68/169] elfutils-0:0.193-3.fc43.x86_6 100% | 12.1 MiB/s | 571.3 KiB | 00m00s [ 69/169] elfutils-libelf-0:0.193-3.fc4 100% | 4.9 MiB/s | 207.8 KiB | 00m00s [ 70/169] libarchive-0:3.8.1-3.fc43.x86 100% | 9.6 MiB/s | 421.1 KiB | 00m00s [ 71/169] popt-0:1.19-9.fc43.x86_64 100% | 1.8 MiB/s | 65.7 KiB | 00m00s [ 72/169] readline-0:8.3-2.fc43.x86_64 100% | 5.3 MiB/s | 224.6 KiB | 00m00s [ 73/169] rpm-build-libs-0:5.99.91-3.fc 100% | 3.3 MiB/s | 128.3 KiB | 00m00s [ 74/169] rpm-libs-0:5.99.91-3.fc43.x86 100% | 8.9 MiB/s | 400.3 KiB | 00m00s [ 75/169] zstd-0:1.5.7-2.fc43.x86_64 100% | 11.0 MiB/s | 485.9 KiB | 00m00s [ 76/169] audit-libs-0:4.1.1-1.fc43.x86 100% | 3.6 MiB/s | 138.5 KiB | 00m00s [ 77/169] libeconf-0:0.7.9-2.fc43.x86_6 100% | 978.1 KiB/s | 35.2 KiB | 00m00s [ 78/169] libsemanage-0:3.9-2.fc43.x86_ 100% | 3.1 MiB/s | 123.4 KiB | 00m00s [ 79/169] libxcrypt-0:4.4.38-8.fc43.x86 100% | 3.3 MiB/s | 127.0 KiB | 00m00s [ 80/169] pam-libs-0:1.7.1-3.fc43.x86_6 100% | 1.5 MiB/s | 57.5 KiB | 00m00s [ 81/169] setup-0:2.15.0-26.fc43.noarch 100% | 3.9 MiB/s | 157.3 KiB | 00m00s [ 82/169] xz-libs-1:5.8.1-2.fc43.x86_64 100% | 3.0 MiB/s | 112.9 KiB | 00m00s [ 83/169] mpfr-0:4.2.2-2.fc43.x86_64 100% | 8.1 MiB/s | 347.0 KiB | 00m00s [ 84/169] libblkid-0:2.41.1-15.fc43.x86 100% | 3.2 MiB/s | 122.9 KiB | 00m00s [ 85/169] libcap-ng-0:0.8.5-6.fc43.x86_ 100% | 891.2 KiB/s | 32.1 KiB | 00m00s [ 86/169] libfdisk-0:2.41.1-15.fc43.x86 100% | 4.1 MiB/s | 161.1 KiB | 00m00s [ 87/169] liblastlog2-0:2.41.1-15.fc43. 100% | 622.2 KiB/s | 23.0 KiB | 00m00s [ 88/169] libmount-0:2.41.1-15.fc43.x86 100% | 4.2 MiB/s | 162.5 KiB | 00m00s [ 89/169] libsmartcols-0:2.41.1-15.fc43 100% | 2.2 MiB/s | 83.7 KiB | 00m00s [ 90/169] libuuid-0:2.41.1-15.fc43.x86_ 100% | 701.3 KiB/s | 25.9 KiB | 00m00s [ 91/169] util-linux-core-0:2.41.1-15.f 100% | 12.0 MiB/s | 551.0 KiB | 00m00s [ 92/169] zlib-ng-compat-0:2.2.5-1.fc43 100% | 2.1 MiB/s | 79.2 KiB | 00m00s [ 93/169] ncurses-base-0:6.5-7.20250614 100% | 1.9 MiB/s | 88.2 KiB | 00m00s [ 94/169] libsepol-0:3.9-2.fc43.x86_64 100% | 8.2 MiB/s | 345.4 KiB | 00m00s [ 95/169] ca-certificates-0:2024.2.69_v 100% | 17.1 MiB/s | 944.0 KiB | 00m00s [ 96/169] crypto-policies-0:20250714-3. 100% | 2.4 MiB/s | 98.2 KiB | 00m00s [ 97/169] fedora-gpg-keys-0:43-0.2.noar 100% | 3.6 MiB/s | 136.6 KiB | 00m00s [ 98/169] fedora-repos-rawhide-0:43-0.2 100% | 244.5 KiB/s | 8.8 KiB | 00m00s [ 99/169] pcre2-syntax-0:10.45-1.fc43.1 100% | 4.0 MiB/s | 161.7 KiB | 00m00s [100/169] add-determinism-0:0.6.0-2.fc4 100% | 15.8 MiB/s | 919.3 KiB | 00m00s [101/169] file-libs-0:5.46-6.fc43.x86_6 100% | 16.6 MiB/s | 849.5 KiB | 00m00s [102/169] curl-0:8.15.0-2.fc43.x86_64 100% | 5.6 MiB/s | 233.7 KiB | 00m00s [103/169] elfutils-libs-0:0.193-3.fc43. 100% | 6.6 MiB/s | 269.7 KiB | 00m00s [104/169] elfutils-debuginfod-client-0: 100% | 1.3 MiB/s | 46.8 KiB | 00m00s [105/169] libzstd-0:1.5.7-2.fc43.x86_64 100% | 7.3 MiB/s | 314.6 KiB | 00m00s [106/169] libxml2-0:2.12.10-3.fc43.x86_ 100% | 13.2 MiB/s | 691.9 KiB | 00m00s [107/169] glibc-gconv-extra-0:2.42-2.fc 100% | 4.5 MiB/s | 1.6 MiB | 00m00s [108/169] lz4-libs-0:1.10.0-3.fc43.x86_ 100% | 2.1 MiB/s | 78.0 KiB | 00m00s [109/169] lua-libs-0:5.4.8-2.fc43.x86_6 100% | 3.3 MiB/s | 131.7 KiB | 00m00s [110/169] rpm-sequoia-0:1.9.0-2.fc43.x8 100% | 17.9 MiB/s | 933.3 KiB | 00m00s [111/169] sqlite-libs-0:3.50.2-2.fc43.x 100% | 14.0 MiB/s | 760.5 KiB | 00m00s [112/169] elfutils-default-yama-scope-0 100% | 355.0 KiB/s | 12.4 KiB | 00m00s [113/169] json-c-0:0.18-7.fc43.x86_64 100% | 1.2 MiB/s | 45.0 KiB | 00m00s [114/169] ima-evm-utils-libs-0:1.6.2-6. 100% | 575.0 KiB/s | 29.3 KiB | 00m00s [115/169] rpm-sign-libs-0:5.99.91-3.fc4 100% | 140.7 KiB/s | 28.1 KiB | 00m00s [116/169] libfsverity-0:1.6-3.fc43.x86_ 100% | 517.4 KiB/s | 18.6 KiB | 00m00s [117/169] gnupg2-0:2.4.8-4.fc43.x86_64 100% | 11.7 MiB/s | 1.6 MiB | 00m00s [118/169] gnupg2-dirmngr-0:2.4.8-4.fc43 100% | 6.4 MiB/s | 274.6 KiB | 00m00s [119/169] gnupg2-gpg-agent-0:2.4.8-4.fc 100% | 6.7 MiB/s | 272.9 KiB | 00m00s [120/169] gnupg2-gpgconf-0:2.4.8-4.fc43 100% | 3.0 MiB/s | 115.0 KiB | 00m00s [121/169] gpgverify-0:2.2-3.fc43.noarch 100% | 127.6 KiB/s | 11.1 KiB | 00m00s [122/169] gnupg2-keyboxd-0:2.4.8-4.fc43 100% | 2.5 MiB/s | 94.7 KiB | 00m00s [123/169] libassuan-0:2.5.7-4.fc43.x86_ 100% | 1.8 MiB/s | 67.4 KiB | 00m00s [124/169] libgcrypt-0:1.11.1-2.fc43.x86 100% | 12.4 MiB/s | 595.8 KiB | 00m00s [125/169] libgpg-error-0:1.55-2.fc43.x8 100% | 5.2 MiB/s | 244.3 KiB | 00m00s [126/169] npth-0:1.8-3.fc43.x86_64 100% | 712.7 KiB/s | 25.7 KiB | 00m00s [127/169] tpm2-tss-0:4.1.3-8.fc43.x86_6 100% | 9.0 MiB/s | 425.9 KiB | 00m00s [128/169] gnutls-0:3.8.10-3.fc43.x86_64 100% | 23.0 MiB/s | 1.4 MiB | 00m00s [129/169] libksba-0:1.6.7-4.fc43.x86_64 100% | 3.3 MiB/s | 160.4 KiB | 00m00s [130/169] libusb1-0:1.0.29-2.fc43.x86_6 100% | 2.0 MiB/s | 79.7 KiB | 00m00s [131/169] openldap-0:2.6.10-3.fc43.x86_ 100% | 6.2 MiB/s | 259.1 KiB | 00m00s [132/169] libtasn1-0:4.20.0-2.fc43.x86_ 100% | 2.0 MiB/s | 74.5 KiB | 00m00s [133/169] libidn2-0:2.3.8-2.fc43.x86_64 100% | 4.3 MiB/s | 174.9 KiB | 00m00s [134/169] libunistring-0:1.1-10.fc43.x8 100% | 11.0 MiB/s | 542.9 KiB | 00m00s [135/169] nettle-0:3.10.1-2.fc43.x86_64 100% | 8.3 MiB/s | 424.2 KiB | 00m00s [136/169] p11-kit-0:0.25.5-9.fc43.x86_6 100% | 10.8 MiB/s | 488.1 KiB | 00m00s [137/169] cyrus-sasl-lib-0:2.1.28-33.fc 100% | 13.5 MiB/s | 787.9 KiB | 00m00s [138/169] libevent-0:2.1.12-16.fc43.x86 100% | 6.1 MiB/s | 257.8 KiB | 00m00s [139/169] libtool-ltdl-0:2.5.4-7.fc43.x 100% | 1.0 MiB/s | 36.2 KiB | 00m00s [140/169] libffi-0:3.5.1-2.fc43.x86_64 100% | 1.1 MiB/s | 40.9 KiB | 00m00s [141/169] gnupg2-verify-0:2.4.8-4.fc43. 100% | 382.0 KiB/s | 171.2 KiB | 00m00s [142/169] gdbm-libs-1:1.23-10.fc43.x86_ 100% | 1.2 MiB/s | 56.8 KiB | 00m00s [143/169] libgcc-0:15.2.1-1.fc43.x86_64 100% | 2.7 MiB/s | 132.2 KiB | 00m00s [144/169] alternatives-0:1.33-2.fc43.x8 100% | 410.9 KiB/s | 40.7 KiB | 00m00s [145/169] binutils-0:2.45-1.fc43.x86_64 100% | 43.2 MiB/s | 5.9 MiB | 00m00s [146/169] jansson-0:2.14-3.fc43.x86_64 100% | 1.2 MiB/s | 45.3 KiB | 00m00s [147/169] pkgconf-pkg-config-0:2.3.0-3. 100% | 274.5 KiB/s | 9.6 KiB | 00m00s [148/169] pkgconf-0:2.3.0-3.fc43.x86_64 100% | 1.2 MiB/s | 44.6 KiB | 00m00s [149/169] pkgconf-m4-0:2.3.0-3.fc43.noa 100% | 397.5 KiB/s | 13.9 KiB | 00m00s [150/169] p11-kit-trust-0:0.25.5-9.fc43 100% | 609.8 KiB/s | 132.3 KiB | 00m00s [151/169] libpkgconf-0:2.3.0-3.fc43.x86 100% | 1.0 MiB/s | 37.9 KiB | 00m00s [152/169] libstdc++-0:15.2.1-1.fc43.x86 100% | 23.0 MiB/s | 919.4 KiB | 00m00s [153/169] libgomp-0:15.2.1-1.fc43.x86_6 100% | 8.9 MiB/s | 372.1 KiB | 00m00s [154/169] fedora-release-0:43-0.19.noar 100% | 398.0 KiB/s | 14.3 KiB | 00m00s [155/169] xxhash-libs-0:0.8.3-3.fc43.x8 100% | 938.7 KiB/s | 38.5 KiB | 00m00s [156/169] fedora-release-identity-basic 100% | 368.3 KiB/s | 15.1 KiB | 00m00s [157/169] gdb-minimal-0:16.3-4.fc43.x86 100% | 35.6 MiB/s | 4.4 MiB | 00m00s [158/169] libcurl-0:8.15.0-2.fc43.x86_6 100% | 9.9 MiB/s | 404.3 KiB | 00m00s [159/169] krb5-libs-0:1.21.3-7.fc43.x86 100% | 16.1 MiB/s | 758.9 KiB | 00m00s [160/169] systemd-standalone-sysusers-0 100% | 728.6 KiB/s | 134.8 KiB | 00m00s [161/169] libbrotli-0:1.1.0-8.fc43.x86_ 100% | 8.3 MiB/s | 339.0 KiB | 00m00s [162/169] libnghttp2-0:1.66.0-2.fc43.x8 100% | 2.0 MiB/s | 72.5 KiB | 00m00s [163/169] libssh-0:0.11.2-2.fc43.x86_64 100% | 6.0 MiB/s | 232.7 KiB | 00m00s [164/169] keyutils-libs-0:1.6.3-6.fc43. 100% | 870.9 KiB/s | 31.4 KiB | 00m00s [165/169] libcom_err-0:1.47.3-2.fc43.x8 100% | 765.5 KiB/s | 26.8 KiB | 00m00s [166/169] libpsl-0:0.21.5-6.fc43.x86_64 100% | 676.7 KiB/s | 65.0 KiB | 00m00s [167/169] libverto-0:0.3.2-11.fc43.x86_ 100% | 558.8 KiB/s | 20.7 KiB | 00m00s [168/169] publicsuffix-list-dafsa-0:202 100% | 1.6 MiB/s | 59.2 KiB | 00m00s [169/169] libssh-config-0:0.11.2-2.fc43 100% | 205.8 KiB/s | 8.9 KiB | 00m00s -------------------------------------------------------------------------------- [169/169] Total 100% | 17.9 MiB/s | 58.8 MiB | 00m03s Running transaction Importing OpenPGP key 0x31645531: UserID : "Fedora (43) " Fingerprint: C6E7F081CF80E13146676E88829B606631645531 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-43-primary The key was successfully imported. Importing OpenPGP key 0x31645531: UserID : "Fedora (43) " Fingerprint: C6E7F081CF80E13146676E88829B606631645531 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-43-primary The key was successfully imported. Importing OpenPGP key 0x105EF944: UserID : "Fedora (42) " Fingerprint: B0F4950458F69E1150C6C5EDC8AC4916105EF944 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-42-primary The key was successfully imported. Importing OpenPGP key 0x6D9F90A6: UserID : "Fedora (44) " Fingerprint: 36F612DCF27F7D1A48A835E4DBFCF71C6D9F90A6 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-44-primary The key was successfully imported. [ 1/171] Verify package files 100% | 555.0 B/s | 169.0 B | 00m00s >>> Running %pretrans scriptlet: filesystem-0:3.18-50.fc43.x86_64 >>> Finished %pretrans scriptlet: filesystem-0:3.18-50.fc43.x86_64 >>> [RPM] /var/lib/mock/fedora-rawhide-x86_64-1754764510.102582/root/var/cache/d [ 2/171] Prepare transaction 100% | 1.9 KiB/s | 169.0 B | 00m00s [ 3/171] Installing libgcc-0:15.2.1-1. 100% | 87.3 MiB/s | 268.3 KiB | 00m00s [ 4/171] Installing libssh-config-0:0. 100% | 796.9 KiB/s | 816.0 B | 00m00s [ 5/171] Installing publicsuffix-list- 100% | 68.2 MiB/s | 69.8 KiB | 00m00s [ 6/171] Installing fedora-release-ide 100% | 898.4 KiB/s | 920.0 B | 00m00s [ 7/171] Installing fedora-gpg-keys-0: 100% | 15.6 MiB/s | 175.9 KiB | 00m00s [ 8/171] Installing fedora-repos-rawhi 100% | 2.4 MiB/s | 2.4 KiB | 00m00s [ 9/171] Installing fedora-repos-0:43- 100% | 5.6 MiB/s | 5.7 KiB | 00m00s [ 10/171] Installing fedora-release-com 100% | 12.1 MiB/s | 24.8 KiB | 00m00s [ 11/171] Installing fedora-release-0:4 100% | 5.0 KiB/s | 124.0 B | 00m00s >>> Running sysusers scriptlet: setup-0:2.15.0-26.fc43.noarch >>> Finished sysusers scriptlet: setup-0:2.15.0-26.fc43.noarch >>> Scriptlet output: >>> Creating group 'adm' with GID 4. >>> Creating group 'audio' with GID 63. >>> Creating group 'cdrom' with GID 11. >>> Creating group 'clock' with GID 103. >>> Creating group 'dialout' with GID 18. >>> Creating group 'disk' with GID 6. >>> Creating group 'floppy' with GID 19. >>> Creating group 'ftp' with GID 50. >>> Creating group 'games' with GID 20. >>> Creating group 'input' with GID 104. >>> Creating group 'kmem' with GID 9. >>> Creating group 'kvm' with GID 36. >>> Creating group 'lock' with GID 54. >>> Creating group 'lp' with GID 7. >>> Creating group 'mail' with GID 12. >>> Creating group 'man' with GID 15. >>> Creating group 'mem' with GID 8. >>> Creating group 'nobody' with GID 65534. >>> Creating group 'render' with GID 105. >>> Creating group 'root' with GID 0. >>> Creating group 'sgx' with GID 106. >>> Creating group 'sys' with GID 3. >>> Creating group 'tape' with GID 33. >>> Creating group 'tty' with GID 5. >>> Creating group 'users' with GID 100. >>> Creating group 'utmp' with GID 22. >>> Creating group 'video' with GID 39. >>> Creating group 'wheel' with GID 10. >>> Creating user 'adm' (adm) with UID 3 and GID 4. >>> Creating group 'bin' with GID 1. >>> Creating user 'bin' (bin) with UID 1 and GID 1. >>> Creating group 'daemon' with GID 2. >>> Creating user 'daemon' (daemon) with UID 2 and GID 2. >>> Creating user 'ftp' (FTP User) with UID 14 and GID 50. >>> Creating user 'games' (games) with UID 12 and GID 100. >>> Creating user 'halt' (halt) with UID 7 and GID 0. >>> Creating user 'lp' (lp) with UID 4 and GID 7. >>> Creating user 'mail' (mail) with UID 8 and GID 12. >>> Creating user 'nobody' (Kernel Overflow User) with UID 65534 and GID 65534. >>> Creating user 'operator' (operator) with UID 11 and GID 0. >>> Creating user 'root' (Super User) with UID 0 and GID 0. >>> Creating user 'shutdown' (shutdown) with UID 6 and GID 0. >>> Creating user 'sync' (sync) with UID 5 and GID 0. >>> [ 12/171] Installing setup-0:2.15.0-26. 100% | 35.7 MiB/s | 730.6 KiB | 00m00s >>> [RPM] /etc/hosts created as /etc/hosts.rpmnew [ 13/171] Installing filesystem-0:3.18- 100% | 1.4 MiB/s | 212.8 KiB | 00m00s [ 14/171] Installing pkgconf-m4-0:2.3.0 100% | 14.5 MiB/s | 14.8 KiB | 00m00s [ 15/171] Installing pcre2-syntax-0:10. 100% | 135.0 MiB/s | 276.4 KiB | 00m00s [ 16/171] Installing ncurses-base-0:6.5 100% | 34.5 MiB/s | 353.5 KiB | 00m00s [ 17/171] Installing bash-0:5.3.0-2.fc4 100% | 175.6 MiB/s | 8.4 MiB | 00m00s [ 18/171] Installing glibc-common-0:2.4 100% | 48.6 MiB/s | 1.0 MiB | 00m00s [ 19/171] Installing glibc-gconv-extra- 100% | 137.9 MiB/s | 7.3 MiB | 00m00s [ 20/171] Installing glibc-0:2.42-2.fc4 100% | 124.1 MiB/s | 6.7 MiB | 00m00s [ 21/171] Installing ncurses-libs-0:6.5 100% | 155.1 MiB/s | 952.8 KiB | 00m00s [ 22/171] Installing glibc-minimal-lang 100% | 0.0 B/s | 124.0 B | 00m00s [ 23/171] Installing zlib-ng-compat-0:2 100% | 135.2 MiB/s | 138.4 KiB | 00m00s [ 24/171] Installing bzip2-libs-0:1.0.8 100% | 79.8 MiB/s | 81.7 KiB | 00m00s [ 25/171] Installing libgpg-error-0:1.5 100% | 47.3 MiB/s | 921.1 KiB | 00m00s [ 26/171] Installing libstdc++-0:15.2.1 100% | 237.0 MiB/s | 2.8 MiB | 00m00s [ 27/171] Installing xz-libs-1:5.8.1-2. 100% | 106.9 MiB/s | 218.9 KiB | 00m00s [ 28/171] Installing libassuan-0:2.5.7- 100% | 80.9 MiB/s | 165.6 KiB | 00m00s [ 29/171] Installing libgcrypt-0:1.11.1 100% | 225.0 MiB/s | 1.6 MiB | 00m00s [ 30/171] Installing readline-0:8.3-2.f 100% | 125.5 MiB/s | 513.9 KiB | 00m00s [ 31/171] Installing gmp-1:6.3.0-4.fc43 100% | 158.9 MiB/s | 813.5 KiB | 00m00s [ 32/171] Installing libuuid-0:2.41.1-1 100% | 37.4 MiB/s | 38.3 KiB | 00m00s [ 33/171] Installing popt-0:1.19-9.fc43 100% | 27.2 MiB/s | 139.4 KiB | 00m00s [ 34/171] Installing npth-0:1.8-3.fc43. 100% | 49.5 MiB/s | 50.7 KiB | 00m00s [ 35/171] Installing libblkid-0:2.41.1- 100% | 128.7 MiB/s | 263.5 KiB | 00m00s [ 36/171] Installing libxcrypt-0:4.4.38 100% | 93.5 MiB/s | 287.2 KiB | 00m00s [ 37/171] Installing libzstd-0:1.5.7-2. 100% | 195.6 MiB/s | 801.1 KiB | 00m00s [ 38/171] Installing elfutils-libelf-0: 100% | 233.3 MiB/s | 1.2 MiB | 00m00s [ 39/171] Installing sqlite-libs-0:3.50 100% | 216.6 MiB/s | 1.5 MiB | 00m00s [ 40/171] Installing gnupg2-gpgconf-0:2 100% | 17.6 MiB/s | 252.0 KiB | 00m00s [ 41/171] Installing libattr-0:2.5.2-6. 100% | 24.8 MiB/s | 25.4 KiB | 00m00s [ 42/171] Installing libacl-0:2.3.2-4.f 100% | 35.9 MiB/s | 36.8 KiB | 00m00s [ 43/171] Installing libtasn1-0:4.20.0- 100% | 87.0 MiB/s | 178.1 KiB | 00m00s [ 44/171] Installing libunistring-0:1.1 100% | 215.8 MiB/s | 1.7 MiB | 00m00s [ 45/171] Installing libidn2-0:2.3.8-2. 100% | 19.5 MiB/s | 558.7 KiB | 00m00s [ 46/171] Installing crypto-policies-0: 100% | 14.0 MiB/s | 172.0 KiB | 00m00s [ 47/171] Installing dwz-0:0.16-2.fc43. 100% | 18.8 MiB/s | 288.5 KiB | 00m00s [ 48/171] Installing gnupg2-verify-0:2. 100% | 22.8 MiB/s | 349.9 KiB | 00m00s [ 49/171] Installing mpfr-0:4.2.2-2.fc4 100% | 163.0 MiB/s | 834.4 KiB | 00m00s [ 50/171] Installing gawk-0:5.3.2-2.fc4 100% | 72.6 MiB/s | 1.8 MiB | 00m00s [ 51/171] Installing libksba-0:1.6.7-4. 100% | 130.6 MiB/s | 401.1 KiB | 00m00s [ 52/171] Installing unzip-0:6.0-67.fc4 100% | 25.4 MiB/s | 389.8 KiB | 00m00s [ 53/171] Installing file-libs-0:5.46-6 100% | 423.5 MiB/s | 11.9 MiB | 00m00s [ 54/171] Installing file-0:5.46-6.fc43 100% | 7.6 MiB/s | 101.7 KiB | 00m00s [ 55/171] Installing pcre2-0:10.45-1.fc 100% | 170.7 MiB/s | 699.1 KiB | 00m00s [ 56/171] Installing grep-0:3.12-2.fc43 100% | 47.7 MiB/s | 1.0 MiB | 00m00s [ 57/171] Installing xz-1:5.8.1-2.fc43. 100% | 55.5 MiB/s | 1.3 MiB | 00m00s [ 58/171] Installing libeconf-0:0.7.9-2 100% | 65.0 MiB/s | 66.5 KiB | 00m00s [ 59/171] Installing libcap-ng-0:0.8.5- 100% | 69.2 MiB/s | 70.8 KiB | 00m00s [ 60/171] Installing audit-libs-0:4.1.1 100% | 124.2 MiB/s | 381.5 KiB | 00m00s [ 61/171] Installing pam-libs-0:1.7.1-3 100% | 63.0 MiB/s | 129.0 KiB | 00m00s [ 62/171] Installing libcap-0:2.76-2.fc 100% | 13.1 MiB/s | 214.3 KiB | 00m00s [ 63/171] Installing systemd-libs-0:257 100% | 202.9 MiB/s | 2.2 MiB | 00m00s [ 64/171] Installing libsmartcols-0:2.4 100% | 177.2 MiB/s | 181.5 KiB | 00m00s [ 65/171] Installing libsepol-0:3.9-2.f 100% | 200.9 MiB/s | 822.9 KiB | 00m00s [ 66/171] Installing libselinux-0:3.9-2 100% | 94.9 MiB/s | 194.4 KiB | 00m00s [ 67/171] Installing findutils-1:4.10.0 100% | 80.8 MiB/s | 1.9 MiB | 00m00s [ 68/171] Installing sed-0:4.9-5.fc43.x 100% | 42.3 MiB/s | 865.5 KiB | 00m00s [ 69/171] Installing libmount-0:2.41.1- 100% | 121.7 MiB/s | 373.8 KiB | 00m00s [ 70/171] Installing lz4-libs-0:1.10.0- 100% | 79.3 MiB/s | 162.5 KiB | 00m00s [ 71/171] Installing lua-libs-0:5.4.8-2 100% | 137.6 MiB/s | 281.9 KiB | 00m00s [ 72/171] Installing json-c-0:0.18-7.fc 100% | 82.0 MiB/s | 84.0 KiB | 00m00s [ 73/171] Installing libffi-0:3.5.1-2.f 100% | 41.5 MiB/s | 85.0 KiB | 00m00s [ 74/171] Installing p11-kit-0:0.25.5-9 100% | 75.4 MiB/s | 2.2 MiB | 00m00s [ 75/171] Installing alternatives-0:1.3 100% | 4.8 MiB/s | 63.8 KiB | 00m00s [ 76/171] Installing p11-kit-trust-0:0. 100% | 12.5 MiB/s | 397.1 KiB | 00m00s [ 77/171] Installing zstd-0:1.5.7-2.fc4 100% | 77.7 MiB/s | 1.7 MiB | 00m00s [ 78/171] Installing util-linux-core-0: 100% | 59.2 MiB/s | 1.5 MiB | 00m00s [ 79/171] Installing tar-2:1.35-6.fc43. 100% | 109.6 MiB/s | 3.0 MiB | 00m00s [ 80/171] Installing libsemanage-0:3.9- 100% | 101.0 MiB/s | 310.2 KiB | 00m00s [ 81/171] Installing systemd-standalone 100% | 19.4 MiB/s | 277.8 KiB | 00m00s [ 82/171] Installing libusb1-0:1.0.29-2 100% | 84.4 MiB/s | 172.9 KiB | 00m00s [ 83/171] Installing zip-0:3.0-44.fc43. 100% | 42.6 MiB/s | 698.4 KiB | 00m00s [ 84/171] Installing gnupg2-keyboxd-0:2 100% | 13.2 MiB/s | 202.7 KiB | 00m00s [ 85/171] Installing libpsl-0:0.21.5-6. 100% | 75.7 MiB/s | 77.5 KiB | 00m00s [ 86/171] Installing liblastlog2-0:2.41 100% | 2.3 MiB/s | 35.9 KiB | 00m00s [ 87/171] Installing libfdisk-0:2.41.1- 100% | 124.2 MiB/s | 381.5 KiB | 00m00s [ 88/171] Installing nettle-0:3.10.1-2. 100% | 155.0 MiB/s | 793.7 KiB | 00m00s [ 89/171] Installing gnutls-0:3.8.10-3. 100% | 225.8 MiB/s | 3.8 MiB | 00m00s [ 90/171] Installing libxml2-0:2.12.10- 100% | 77.5 MiB/s | 1.7 MiB | 00m00s [ 91/171] Installing bzip2-0:1.0.8-21.f 100% | 6.5 MiB/s | 99.8 KiB | 00m00s [ 92/171] Installing add-determinism-0: 100% | 111.1 MiB/s | 2.4 MiB | 00m00s [ 93/171] Installing build-reproducibil 100% | 1.0 MiB/s | 1.0 KiB | 00m00s [ 94/171] Installing cpio-0:2.15-6.fc43 100% | 55.0 MiB/s | 1.1 MiB | 00m00s [ 95/171] Installing diffutils-0:3.12-3 100% | 71.0 MiB/s | 1.6 MiB | 00m00s [ 96/171] Installing ed-0:1.22-2.fc43.x 100% | 10.4 MiB/s | 149.4 KiB | 00m00s [ 97/171] Installing patch-0:2.8-2.fc43 100% | 14.6 MiB/s | 224.3 KiB | 00m00s [ 98/171] Installing libtool-ltdl-0:2.5 100% | 69.6 MiB/s | 71.2 KiB | 00m00s [ 99/171] Installing gdbm-libs-1:1.23-1 100% | 64.2 MiB/s | 131.6 KiB | 00m00s [100/171] Installing cyrus-sasl-lib-0:2 100% | 95.7 MiB/s | 2.3 MiB | 00m00s [101/171] Installing jansson-0:2.14-3.f 100% | 88.3 MiB/s | 90.5 KiB | 00m00s [102/171] Installing libpkgconf-0:2.3.0 100% | 77.4 MiB/s | 79.2 KiB | 00m00s [103/171] Installing pkgconf-0:2.3.0-3. 100% | 6.3 MiB/s | 91.0 KiB | 00m00s [104/171] Installing pkgconf-pkg-config 100% | 147.8 KiB/s | 1.8 KiB | 00m00s [105/171] Installing libgomp-0:15.2.1-1 100% | 176.6 MiB/s | 542.5 KiB | 00m00s [106/171] Installing xxhash-libs-0:0.8. 100% | 89.4 MiB/s | 91.6 KiB | 00m00s [107/171] Installing libbrotli-0:1.1.0- 100% | 204.0 MiB/s | 835.6 KiB | 00m00s [108/171] Installing libnghttp2-0:1.66. 100% | 159.5 MiB/s | 163.3 KiB | 00m00s [109/171] Installing keyutils-libs-0:1. 100% | 54.4 MiB/s | 55.7 KiB | 00m00s [110/171] Installing libcom_err-0:1.47. 100% | 62.7 MiB/s | 64.2 KiB | 00m00s [111/171] Installing libverto-0:0.3.2-1 100% | 26.6 MiB/s | 27.2 KiB | 00m00s [112/171] Installing filesystem-srpm-ma 100% | 38.0 MiB/s | 38.9 KiB | 00m00s [113/171] Installing elfutils-default-y 100% | 127.7 KiB/s | 2.0 KiB | 00m00s [114/171] Installing elfutils-libs-0:0. 100% | 133.8 MiB/s | 685.2 KiB | 00m00s [115/171] Installing rust-srpm-macros-0 100% | 5.4 MiB/s | 5.6 KiB | 00m00s [116/171] Installing qt6-srpm-macros-0: 100% | 722.7 KiB/s | 740.0 B | 00m00s [117/171] Installing qt5-srpm-macros-0: 100% | 0.0 B/s | 776.0 B | 00m00s [118/171] Installing perl-srpm-macros-0 100% | 0.0 B/s | 1.1 KiB | 00m00s [119/171] Installing package-notes-srpm 100% | 2.0 MiB/s | 2.0 KiB | 00m00s [120/171] Installing openblas-srpm-macr 100% | 0.0 B/s | 392.0 B | 00m00s [121/171] Installing ocaml-srpm-macros- 100% | 0.0 B/s | 2.1 KiB | 00m00s [122/171] Installing kernel-srpm-macros 100% | 2.3 MiB/s | 2.3 KiB | 00m00s [123/171] Installing gnat-srpm-macros-0 100% | 1.2 MiB/s | 1.3 KiB | 00m00s [124/171] Installing ghc-srpm-macros-0: 100% | 0.0 B/s | 1.0 KiB | 00m00s [125/171] Installing fpc-srpm-macros-0: 100% | 0.0 B/s | 420.0 B | 00m00s [126/171] Installing ansible-srpm-macro 100% | 35.4 MiB/s | 36.2 KiB | 00m00s [127/171] Installing coreutils-common-0 100% | 205.3 MiB/s | 11.3 MiB | 00m00s [128/171] Installing openssl-libs-1:3.5 100% | 247.3 MiB/s | 8.9 MiB | 00m00s [129/171] Installing coreutils-0:9.7-5. 100% | 99.0 MiB/s | 5.4 MiB | 00m00s [130/171] Installing ca-certificates-0: 100% | 1.1 MiB/s | 2.4 MiB | 00m02s [131/171] Installing libarchive-0:3.8.1 100% | 155.1 MiB/s | 953.1 KiB | 00m00s [132/171] Installing krb5-libs-0:1.21.3 100% | 79.0 MiB/s | 2.3 MiB | 00m00s >>> Running sysusers scriptlet: tpm2-tss-0:4.1.3-8.fc43.x86_64 >>> Finished sysusers scriptlet: tpm2-tss-0:4.1.3-8.fc43.x86_64 >>> Scriptlet output: >>> Creating group 'tss' with GID 59. >>> Creating user 'tss' (Account used for TPM access) with UID 59 and GID 59. >>> [133/171] Installing tpm2-tss-0:4.1.3-8 100% | 98.3 MiB/s | 1.6 MiB | 00m00s [134/171] Installing ima-evm-utils-libs 100% | 60.5 MiB/s | 62.0 KiB | 00m00s [135/171] Installing gnupg2-gpg-agent-0 100% | 21.3 MiB/s | 675.4 KiB | 00m00s [136/171] Installing libssh-0:0.11.2-2. 100% | 138.9 MiB/s | 568.7 KiB | 00m00s [137/171] Installing gzip-0:1.13-4.fc43 100% | 24.1 MiB/s | 394.4 KiB | 00m00s [138/171] Installing rpm-sequoia-0:1.9. 100% | 247.8 MiB/s | 2.5 MiB | 00m00s [139/171] Installing rpm-libs-0:5.99.91 100% | 182.7 MiB/s | 935.3 KiB | 00m00s [140/171] Installing libfsverity-0:1.6- 100% | 28.8 MiB/s | 29.5 KiB | 00m00s [141/171] Installing libevent-0:2.1.12- 100% | 173.2 MiB/s | 886.8 KiB | 00m00s [142/171] Installing openldap-0:2.6.10- 100% | 128.8 MiB/s | 659.6 KiB | 00m00s [143/171] Installing libcurl-0:8.15.0-2 100% | 220.8 MiB/s | 904.3 KiB | 00m00s [144/171] Installing elfutils-debuginfo 100% | 5.6 MiB/s | 86.2 KiB | 00m00s [145/171] Installing elfutils-0:0.193-3 100% | 121.6 MiB/s | 2.9 MiB | 00m00s [146/171] Installing binutils-0:2.45-1. 100% | 217.5 MiB/s | 26.5 MiB | 00m00s [147/171] Installing gdb-minimal-0:16.3 100% | 232.4 MiB/s | 13.2 MiB | 00m00s [148/171] Installing debugedit-0:5.2-2. 100% | 14.0 MiB/s | 200.5 KiB | 00m00s [149/171] Installing curl-0:8.15.0-2.fc 100% | 14.1 MiB/s | 476.3 KiB | 00m00s [150/171] Installing rpm-0:5.99.91-3.fc 100% | 43.0 MiB/s | 2.5 MiB | 00m00s [151/171] Installing efi-srpm-macros-0: 100% | 40.2 MiB/s | 41.1 KiB | 00m00s [152/171] Installing java-srpm-macros-0 100% | 1.1 MiB/s | 1.1 KiB | 00m00s [153/171] Installing lua-srpm-macros-0: 100% | 1.9 MiB/s | 1.9 KiB | 00m00s [154/171] Installing tree-sitter-srpm-m 100% | 9.1 MiB/s | 9.3 KiB | 00m00s [155/171] Installing zig-srpm-macros-0: 100% | 1.6 MiB/s | 1.7 KiB | 00m00s [156/171] Installing gnupg2-dirmngr-0:2 100% | 20.9 MiB/s | 621.1 KiB | 00m00s [157/171] Installing gnupg2-0:2.4.8-4.f 100% | 163.8 MiB/s | 6.6 MiB | 00m00s [158/171] Installing rpm-sign-libs-0:5. 100% | 39.6 MiB/s | 40.6 KiB | 00m00s [159/171] Installing rpm-build-libs-0:5 100% | 131.5 MiB/s | 269.2 KiB | 00m00s [160/171] Installing gpgverify-0:2.2-3. 100% | 9.2 MiB/s | 9.4 KiB | 00m00s [161/171] Installing rpm-build-0:5.99.9 100% | 14.4 MiB/s | 294.4 KiB | 00m00s [162/171] Installing pyproject-srpm-mac 100% | 2.4 MiB/s | 2.5 KiB | 00m00s [163/171] Installing redhat-rpm-config- 100% | 30.8 MiB/s | 189.3 KiB | 00m00s [164/171] Installing forge-srpm-macros- 100% | 19.7 MiB/s | 40.3 KiB | 00m00s [165/171] Installing fonts-srpm-macros- 100% | 27.8 MiB/s | 57.0 KiB | 00m00s [166/171] Installing go-srpm-macros-0:3 100% | 61.3 MiB/s | 62.7 KiB | 00m00s [167/171] Installing python-srpm-macros 100% | 51.6 MiB/s | 52.8 KiB | 00m00s [168/171] Installing which-0:2.23-3.fc4 100% | 4.9 MiB/s | 85.7 KiB | 00m00s [169/171] Installing util-linux-0:2.41. 100% | 61.6 MiB/s | 3.6 MiB | 00m00s [170/171] Installing shadow-utils-2:4.1 100% | 81.0 MiB/s | 4.0 MiB | 00m00s [171/171] Installing info-0:7.2-6.fc43. 100% | 132.2 KiB/s | 354.3 KiB | 00m03s Warning: skipped OpenPGP checks for 3 packages from repository: copr_base Complete! Finish: installing minimal buildroot with dnf5 Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: add-determinism-0.6.0-2.fc43.x86_64 alternatives-1.33-2.fc43.x86_64 ansible-srpm-macros-1-18.1.fc43.noarch audit-libs-4.1.1-1.fc43.x86_64 bash-5.3.0-2.fc43.x86_64 binutils-2.45-1.fc43.x86_64 build-reproducibility-srpm-macros-0.6.0-2.fc43.noarch bzip2-1.0.8-21.fc43.x86_64 bzip2-libs-1.0.8-21.fc43.x86_64 ca-certificates-2024.2.69_v8.0.401-7.fc43.noarch coreutils-9.7-5.fc43.x86_64 coreutils-common-9.7-5.fc43.x86_64 cpio-2.15-6.fc43.x86_64 crypto-policies-20250714-3.gitcd6043a.fc43.noarch curl-8.15.0-2.fc43.x86_64 cyrus-sasl-lib-2.1.28-33.fc43.x86_64 debugedit-5.2-2.fc43.x86_64 diffutils-3.12-3.fc43.x86_64 dwz-0.16-2.fc43.x86_64 ed-1.22-2.fc43.x86_64 efi-srpm-macros-6-4.fc43.noarch elfutils-0.193-3.fc43.x86_64 elfutils-debuginfod-client-0.193-3.fc43.x86_64 elfutils-default-yama-scope-0.193-3.fc43.noarch elfutils-libelf-0.193-3.fc43.x86_64 elfutils-libs-0.193-3.fc43.x86_64 fedora-gpg-keys-43-0.2.noarch fedora-release-43-0.19.noarch fedora-release-common-43-0.19.noarch fedora-release-identity-basic-43-0.19.noarch fedora-repos-43-0.2.noarch fedora-repos-rawhide-43-0.2.noarch file-5.46-6.fc43.x86_64 file-libs-5.46-6.fc43.x86_64 filesystem-3.18-50.fc43.x86_64 filesystem-srpm-macros-3.18-50.fc43.noarch findutils-4.10.0-6.fc43.x86_64 fonts-srpm-macros-2.0.5-23.fc43.noarch forge-srpm-macros-0.4.0-3.fc43.noarch fpc-srpm-macros-1.3-15.fc43.noarch gawk-5.3.2-2.fc43.x86_64 gdb-minimal-16.3-4.fc43.x86_64 gdbm-libs-1.23-10.fc43.x86_64 ghc-srpm-macros-1.9.2-3.fc43.noarch glibc-2.42-2.fc43.x86_64 glibc-common-2.42-2.fc43.x86_64 glibc-gconv-extra-2.42-2.fc43.x86_64 glibc-minimal-langpack-2.42-2.fc43.x86_64 gmp-6.3.0-4.fc43.x86_64 gnat-srpm-macros-6-8.fc43.noarch gnupg2-2.4.8-4.fc43.x86_64 gnupg2-dirmngr-2.4.8-4.fc43.x86_64 gnupg2-gpg-agent-2.4.8-4.fc43.x86_64 gnupg2-gpgconf-2.4.8-4.fc43.x86_64 gnupg2-keyboxd-2.4.8-4.fc43.x86_64 gnupg2-verify-2.4.8-4.fc43.x86_64 gnutls-3.8.10-3.fc43.x86_64 go-srpm-macros-3.7.0-4.fc43.noarch gpg-pubkey-36f612dcf27f7d1a48a835e4dbfcf71c6d9f90a6-6786af3b gpg-pubkey-b0f4950458f69e1150c6c5edc8ac4916105ef944-65ca83d1 gpg-pubkey-c6e7f081cf80e13146676e88829b606631645531-66b6dccf gpgverify-2.2-3.fc43.noarch grep-3.12-2.fc43.x86_64 gzip-1.13-4.fc43.x86_64 ima-evm-utils-libs-1.6.2-6.fc43.x86_64 info-7.2-6.fc43.x86_64 jansson-2.14-3.fc43.x86_64 java-srpm-macros-1-7.fc43.noarch json-c-0.18-7.fc43.x86_64 kernel-srpm-macros-1.0-27.fc43.noarch keyutils-libs-1.6.3-6.fc43.x86_64 krb5-libs-1.21.3-7.fc43.x86_64 libacl-2.3.2-4.fc43.x86_64 libarchive-3.8.1-3.fc43.x86_64 libassuan-2.5.7-4.fc43.x86_64 libattr-2.5.2-6.fc43.x86_64 libblkid-2.41.1-15.fc43.x86_64 libbrotli-1.1.0-8.fc43.x86_64 libcap-2.76-2.fc43.x86_64 libcap-ng-0.8.5-6.fc43.x86_64 libcom_err-1.47.3-2.fc43.x86_64 libcurl-8.15.0-2.fc43.x86_64 libeconf-0.7.9-2.fc43.x86_64 libevent-2.1.12-16.fc43.x86_64 libfdisk-2.41.1-15.fc43.x86_64 libffi-3.5.1-2.fc43.x86_64 libfsverity-1.6-3.fc43.x86_64 libgcc-15.2.1-1.fc43.x86_64 libgcrypt-1.11.1-2.fc43.x86_64 libgomp-15.2.1-1.fc43.x86_64 libgpg-error-1.55-2.fc43.x86_64 libidn2-2.3.8-2.fc43.x86_64 libksba-1.6.7-4.fc43.x86_64 liblastlog2-2.41.1-15.fc43.x86_64 libmount-2.41.1-15.fc43.x86_64 libnghttp2-1.66.0-2.fc43.x86_64 libpkgconf-2.3.0-3.fc43.x86_64 libpsl-0.21.5-6.fc43.x86_64 libselinux-3.9-2.fc43.x86_64 libsemanage-3.9-2.fc43.x86_64 libsepol-3.9-2.fc43.x86_64 libsmartcols-2.41.1-15.fc43.x86_64 libssh-0.11.2-2.fc43.x86_64 libssh-config-0.11.2-2.fc43.noarch libstdc++-15.2.1-1.fc43.x86_64 libtasn1-4.20.0-2.fc43.x86_64 libtool-ltdl-2.5.4-7.fc43.x86_64 libunistring-1.1-10.fc43.x86_64 libusb1-1.0.29-2.fc43.x86_64 libuuid-2.41.1-15.fc43.x86_64 libverto-0.3.2-11.fc43.x86_64 libxcrypt-4.4.38-8.fc43.x86_64 libxml2-2.12.10-3.fc43.x86_64 libzstd-1.5.7-2.fc43.x86_64 lua-libs-5.4.8-2.fc43.x86_64 lua-srpm-macros-1-16.fc43.noarch lz4-libs-1.10.0-3.fc43.x86_64 mpfr-4.2.2-2.fc43.x86_64 ncurses-base-6.5-7.20250614.fc43.noarch ncurses-libs-6.5-7.20250614.fc43.x86_64 nettle-3.10.1-2.fc43.x86_64 npth-1.8-3.fc43.x86_64 ocaml-srpm-macros-11-2.fc43.noarch openblas-srpm-macros-2-20.fc43.noarch openldap-2.6.10-3.fc43.x86_64 openssl-libs-3.5.1-2.fc43.x86_64 p11-kit-0.25.5-9.fc43.x86_64 p11-kit-trust-0.25.5-9.fc43.x86_64 package-notes-srpm-macros-0.5-14.fc43.noarch pam-libs-1.7.1-3.fc43.x86_64 patch-2.8-2.fc43.x86_64 pcre2-10.45-1.fc43.1.x86_64 pcre2-syntax-10.45-1.fc43.1.noarch perl-srpm-macros-1-60.fc43.noarch pkgconf-2.3.0-3.fc43.x86_64 pkgconf-m4-2.3.0-3.fc43.noarch pkgconf-pkg-config-2.3.0-3.fc43.x86_64 popt-1.19-9.fc43.x86_64 publicsuffix-list-dafsa-20250616-2.fc43.noarch pyproject-srpm-macros-1.18.3-2.fc43.noarch python-srpm-macros-3.14-4.fc43.noarch qt5-srpm-macros-5.15.17-2.fc43.noarch qt6-srpm-macros-6.9.1-2.fc43.noarch readline-8.3-2.fc43.x86_64 redhat-rpm-config-343-10.fc43.noarch rpm-5.99.91-3.fc43.x86_64 rpm-build-5.99.91-3.fc43.x86_64 rpm-build-libs-5.99.91-3.fc43.x86_64 rpm-libs-5.99.91-3.fc43.x86_64 rpm-sequoia-1.9.0-2.fc43.x86_64 rpm-sign-libs-5.99.91-3.fc43.x86_64 rust-srpm-macros-26.3-5.fc43.noarch sed-4.9-5.fc43.x86_64 setup-2.15.0-26.fc43.noarch shadow-utils-4.18.0-3.fc43.x86_64 sqlite-libs-3.50.2-2.fc43.x86_64 systemd-libs-257.7-1.fc43.x86_64 systemd-standalone-sysusers-257.7-1.fc43.x86_64 tar-1.35-6.fc43.x86_64 tpm2-tss-4.1.3-8.fc43.x86_64 tree-sitter-srpm-macros-0.4.2-1.fc43.noarch unzip-6.0-67.fc43.x86_64 util-linux-2.41.1-15.fc43.x86_64 util-linux-core-2.41.1-15.fc43.x86_64 which-2.23-3.fc43.x86_64 xxhash-libs-0.8.3-3.fc43.x86_64 xz-5.8.1-2.fc43.x86_64 xz-libs-5.8.1-2.fc43.x86_64 zig-srpm-macros-1-5.fc43.noarch zip-3.0-44.fc43.x86_64 zlib-ng-compat-2.2.5-1.fc43.x86_64 zstd-1.5.7-2.fc43.x86_64 Start: buildsrpm Start: rpmbuild -bs Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1753747200 Wrote: /builddir/build/SRPMS/rccl-6.4.2-3.fc43.src.rpm Finish: rpmbuild -bs INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-rawhide-x86_64-1754764510.102582/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-puw2w4my/rccl/rccl.spec) Config(child) 0 minutes 39 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/rccl-6.4.2-3.fc43.src.rpm) Config(fedora-rawhide-x86_64) Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1754764510.102582/root. INFO: reusing tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1754764510.102582/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-1754764510.102582/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-5.99.91-3.fc43.x86_64 rpm-sequoia-1.9.0-2.fc43.x86_64 dnf5-5.2.16.0-1.fc43.x86_64 dnf5-plugins-5.2.16.0-1.fc43.x86_64 Finish: chroot init Start: build phase for rccl-6.4.2-3.fc43.src.rpm Start: build setup for rccl-6.4.2-3.fc43.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1753747200 Wrote: /builddir/build/SRPMS/rccl-6.4.2-3.fc43.src.rpm Updating and loading repositories: Copr repository 100% | 6.6 KiB/s | 1.5 KiB | 00m00s fedora 100% | 101.9 KiB/s | 26.9 KiB | 00m00s Repositories loaded. Package Arch Version Repository Size Installing: cmake x86_64 3.31.6-4.fc43 fedora 34.5 MiB gcc-c++ x86_64 15.2.1-1.fc43 copr_base 41.4 MiB hipify x86_64 6.4.1-3.fc43 copr_base 3.1 MiB rocm-cmake noarch 6.4.0-2.fc43 copr_base 130.5 KiB rocm-comgr-devel x86_64 19-15.rocm6.4.2.fc43 copr_base 98.2 KiB rocm-core-devel x86_64 6.4.3-1.fc43 copr_base 14.8 KiB rocm-hip-devel x86_64 6.4.2-2.fc43 copr_base 2.8 MiB rocm-rpm-macros noarch 6.4.2-1.fc43 fedora 18.9 KiB rocm-runtime-devel x86_64 6.4.2-2.fc43 copr_base 571.4 KiB rocm-smi-devel x86_64 6.4.3-1.fc43 copr_base 281.8 KiB Installing dependencies: annobin-docs noarch 12.99-1.fc43 fedora 98.9 KiB annobin-plugin-gcc x86_64 12.99-1.fc43 fedora 1.0 MiB cmake-data noarch 3.31.6-4.fc43 fedora 8.5 MiB cmake-filesystem x86_64 3.31.6-4.fc43 fedora 0.0 B cmake-rpm-macros noarch 3.31.6-4.fc43 fedora 7.7 KiB cpp x86_64 15.2.1-1.fc43 copr_base 37.9 MiB emacs-filesystem noarch 1:30.0-5.fc43 fedora 0.0 B environment-modules x86_64 5.6.0-1.fc43 fedora 1.9 MiB expat x86_64 2.7.1-3.fc43 fedora 294.2 KiB gcc x86_64 15.2.1-1.fc43 copr_base 111.9 MiB gcc-plugin-annobin x86_64 15.2.1-1.fc43 copr_base 57.2 KiB git x86_64 2.50.1-2.fc43 fedora 85.1 KiB git-core x86_64 2.50.1-2.fc43 fedora 23.5 MiB git-core-doc noarch 2.50.1-2.fc43 fedora 17.7 MiB glibc-devel x86_64 2.42-2.fc43 fedora 2.3 MiB groff-base x86_64 1.23.0-9.fc43 fedora 3.8 MiB hipcc x86_64 19-15.rocm6.4.2.fc43 copr_base 653.0 KiB hwdata noarch 0.398-1.fc43 fedora 9.6 MiB jsoncpp x86_64 1.9.6-2.fc43 fedora 257.6 KiB kernel-headers x86_64 6.16.0-65.fc43 fedora 6.7 MiB less x86_64 679-2.fc43 fedora 406.1 KiB libcbor x86_64 0.12.0-6.fc43 fedora 77.8 KiB libdb x86_64 5.3.28-66.fc43 fedora 1.9 MiB libdrm x86_64 2.4.125-2.fc43 fedora 395.8 KiB libdrm-devel x86_64 2.4.125-2.fc43 fedora 728.8 KiB libedit x86_64 3.1-56.20250104cvs.fc43 fedora 240.1 KiB libfido2 x86_64 1.16.0-3.fc43 fedora 238.5 KiB libmpc x86_64 1.3.1-8.fc43 fedora 160.6 KiB libpciaccess x86_64 0.16-16.fc43 fedora 44.5 KiB libpciaccess-devel x86_64 0.16-16.fc43 fedora 15.3 KiB libpipeline x86_64 1.5.8-3.fc43 fedora 145.1 KiB libstdc++-devel x86_64 15.2.1-1.fc43 copr_base 37.3 MiB libtommath x86_64 1.3.1~rc1-6.fc43 fedora 126.4 KiB libuv x86_64 1:1.51.0-2.fc43 fedora 570.2 KiB libxcrypt-devel x86_64 4.4.38-8.fc43 fedora 30.8 KiB make x86_64 1:4.4.1-11.fc43 fedora 1.8 MiB man-db x86_64 2.13.1-2.fc43 fedora 2.9 MiB mpdecimal x86_64 4.0.1-2.fc43 fedora 217.2 KiB ncurses x86_64 6.5-7.20250614.fc43 fedora 609.8 KiB numactl-libs x86_64 2.0.19-3.fc43 fedora 56.9 KiB openssh x86_64 10.0p1-5.fc43 fedora 1.4 MiB openssh-clients x86_64 10.0p1-5.fc43 fedora 2.6 MiB perl x86_64 4:5.42.0-520.fc43 fedora 0.0 B perl-Algorithm-Diff noarch 1.2010-14.fc43 fedora 107.5 KiB perl-Archive-Tar noarch 3.04-521.fc43 fedora 154.4 KiB perl-Archive-Zip noarch 1.68-17.fc43 fedora 291.1 KiB perl-Attribute-Handlers noarch 1.03-520.fc43 fedora 39.9 KiB perl-AutoLoader noarch 5.74-520.fc43 fedora 20.6 KiB perl-AutoSplit noarch 5.74-520.fc43 fedora 23.1 KiB perl-B x86_64 1.89-520.fc43 fedora 501.3 KiB perl-Benchmark noarch 1.27-520.fc43 fedora 36.4 KiB perl-CPAN noarch 2.38-521.fc43 fedora 1.9 MiB perl-CPAN-Meta noarch 2.150010-520.fc43 fedora 592.2 KiB perl-CPAN-Meta-Requirements noarch 2.143-13.fc43 fedora 81.2 KiB perl-CPAN-Meta-YAML noarch 0.020-521.fc43 fedora 52.1 KiB perl-Carp noarch 1.54-520.fc43 fedora 46.6 KiB perl-Class-Struct noarch 0.68-520.fc43 fedora 25.4 KiB perl-Compress-Bzip2 x86_64 2.28-24.fc43 fedora 142.6 KiB perl-Compress-Raw-Bzip2 x86_64 2.213-521.fc43 fedora 67.3 KiB perl-Compress-Raw-Lzma x86_64 2.213-7.fc43 fedora 120.9 KiB perl-Compress-Raw-Zlib x86_64 2.213-521.fc43 fedora 163.2 KiB perl-Config-Extensions noarch 0.03-520.fc43 fedora 2.7 KiB perl-Config-Perl-V noarch 0.38-521.fc43 fedora 25.9 KiB perl-DBM_Filter noarch 0.07-520.fc43 fedora 28.7 KiB perl-DB_File x86_64 1.859-516.fc43 fedora 188.8 KiB perl-Data-Dumper x86_64 2.191-521.fc43 fedora 115.6 KiB perl-Data-OptList noarch 0.114-7.fc43 fedora 50.1 KiB perl-Data-Section noarch 0.200008-8.fc43 fedora 42.7 KiB perl-Devel-PPPort x86_64 3.73-521.fc43 fedora 889.8 KiB perl-Devel-Peek x86_64 1.36-520.fc43 fedora 43.5 KiB perl-Devel-SelfStubber noarch 1.06-520.fc43 fedora 6.8 KiB perl-Devel-Size x86_64 0.85-3.fc43 fedora 42.1 KiB perl-Digest noarch 1.20-520.fc43 fedora 35.3 KiB perl-Digest-MD5 x86_64 2.59-520.fc43 fedora 59.7 KiB perl-Digest-SHA x86_64 1:6.04-521.fc43 fedora 112.5 KiB perl-DirHandle noarch 1.05-520.fc43 fedora 3.4 KiB perl-Dumpvalue noarch 2.27-520.fc43 fedora 19.8 KiB perl-DynaLoader x86_64 1.57-520.fc43 fedora 32.1 KiB perl-Encode x86_64 4:3.21-520.fc43 fedora 4.7 MiB perl-Encode-devel x86_64 4:3.21-520.fc43 fedora 99.6 KiB perl-English noarch 1.11-520.fc43 fedora 6.2 KiB perl-Env noarch 1.06-520.fc43 fedora 26.1 KiB perl-Errno x86_64 1.38-520.fc43 fedora 8.4 KiB perl-Error noarch 1:0.17030-2.fc43 fedora 76.7 KiB perl-Exporter noarch 5.79-520.fc43 fedora 54.3 KiB perl-ExtUtils-CBuilder noarch 1:0.280242-520.fc43 fedora 97.3 KiB perl-ExtUtils-Command noarch 2:7.76-521.fc43 fedora 9.6 KiB perl-ExtUtils-Constant noarch 0.25-520.fc43 fedora 85.9 KiB perl-ExtUtils-Embed noarch 1.35-520.fc43 fedora 15.6 KiB perl-ExtUtils-Install noarch 2.22-520.fc43 fedora 85.5 KiB perl-ExtUtils-MM-Utils noarch 2:7.76-521.fc43 fedora 2.9 KiB perl-ExtUtils-MakeMaker noarch 2:7.76-521.fc43 fedora 739.7 KiB perl-ExtUtils-Manifest noarch 1:1.75-520.fc43 fedora 84.8 KiB perl-ExtUtils-Miniperl noarch 1.14-520.fc43 fedora 8.3 KiB perl-ExtUtils-ParseXS noarch 1:3.58-2.fc43 fedora 539.6 KiB perl-Fcntl x86_64 1.20-520.fc43 fedora 48.8 KiB perl-File-Basename noarch 2.86-520.fc43 fedora 14.0 KiB perl-File-Compare noarch 1.100.800-520.fc43 fedora 5.6 KiB perl-File-Copy noarch 2.41-520.fc43 fedora 19.7 KiB perl-File-DosGlob x86_64 1.12-520.fc43 fedora 20.8 KiB perl-File-Fetch noarch 1.08-3.fc43 fedora 60.3 KiB perl-File-Find noarch 1.44-520.fc43 fedora 42.0 KiB perl-File-HomeDir noarch 1.006-15.fc43 fedora 119.3 KiB perl-File-Path noarch 2.18-520.fc43 fedora 63.5 KiB perl-File-Temp noarch 1:0.231.100-520.fc43 fedora 162.3 KiB perl-File-Which noarch 1.27-14.fc43 fedora 30.4 KiB perl-File-stat noarch 1.14-520.fc43 fedora 12.5 KiB perl-FileCache noarch 1.10-520.fc43 fedora 7.5 KiB perl-FileHandle noarch 2.05-520.fc43 fedora 9.4 KiB perl-Filter x86_64 2:1.64-521.fc43 fedora 156.7 KiB perl-Filter-Simple noarch 0.96-520.fc43 fedora 50.7 KiB perl-FindBin noarch 1.54-520.fc43 fedora 6.8 KiB perl-GDBM_File x86_64 1:1.24-520.fc43 fedora 79.6 KiB perl-Getopt-Long noarch 1:2.58-520.fc43 fedora 144.5 KiB perl-Getopt-Std noarch 1.14-520.fc43 fedora 11.2 KiB perl-Git noarch 2.50.1-2.fc43 fedora 64.0 KiB perl-HTTP-Tiny noarch 0.090-521.fc43 fedora 154.4 KiB perl-Hash-Util x86_64 0.32-520.fc43 fedora 55.0 KiB perl-Hash-Util-FieldHash x86_64 1.27-520.fc43 fedora 62.6 KiB perl-I18N-Collate noarch 1.02-520.fc43 fedora 7.1 KiB perl-I18N-LangTags noarch 0.45-520.fc43 fedora 82.4 KiB perl-I18N-Langinfo x86_64 0.24-520.fc43 fedora 34.7 KiB perl-IO x86_64 1.55-520.fc43 fedora 147.4 KiB perl-IO-Compress noarch 2.213-521.fc43 fedora 1.0 MiB perl-IO-Compress-Lzma noarch 2.213-3.fc43 fedora 215.2 KiB perl-IO-Socket-IP noarch 0.43-521.fc43 fedora 100.3 KiB perl-IO-Socket-SSL noarch 2.095-2.fc43 fedora 714.5 KiB perl-IO-Zlib noarch 1:1.15-520.fc43 fedora 25.7 KiB perl-IPC-Cmd noarch 2:1.04-521.fc43 fedora 84.9 KiB perl-IPC-Open3 noarch 1.24-520.fc43 fedora 27.7 KiB perl-IPC-SysV x86_64 2.09-521.fc43 fedora 73.7 KiB perl-IPC-System-Simple noarch 1.30-16.fc43 fedora 71.7 KiB perl-JSON-PP noarch 1:4.16-521.fc43 fedora 141.8 KiB perl-Locale-Maketext noarch 1.33-521.fc43 fedora 171.3 KiB perl-Locale-Maketext-Simple noarch 1:0.21-520.fc43 fedora 12.8 KiB perl-MIME-Base32 noarch 1.303-24.fc43 fedora 30.7 KiB perl-MIME-Base64 x86_64 3.16-520.fc43 fedora 42.0 KiB perl-MRO-Compat noarch 0.15-12.fc43 fedora 43.0 KiB perl-Math-BigInt noarch 1:2.0050.03-3.fc43 fedora 1.1 MiB perl-Math-BigInt-FastCalc x86_64 0.502.000-520.fc43 fedora 44.0 KiB perl-Math-Complex noarch 1.63-520.fc43 fedora 85.1 KiB perl-Memoize noarch 1.17-520.fc43 fedora 64.7 KiB perl-Module-Build noarch 2:0.42.34-9.fc43 fedora 654.2 KiB perl-Module-CoreList noarch 1:5.20250803-1.fc43 fedora 1.2 MiB perl-Module-CoreList-tools noarch 1:5.20250803-1.fc43 fedora 18.6 KiB perl-Module-Load noarch 1:0.36-520.fc43 fedora 14.9 KiB perl-Module-Load-Conditional noarch 0.74-520.fc43 fedora 28.7 KiB perl-Module-Loaded noarch 1:0.08-520.fc43 fedora 5.0 KiB perl-Module-Metadata noarch 1.000038-520.fc43 fedora 67.5 KiB perl-Module-Signature noarch 0.93-2.fc43 fedora 136.5 KiB perl-NDBM_File x86_64 1.18-520.fc43 fedora 28.5 KiB perl-NEXT noarch 0.69-520.fc43 fedora 23.6 KiB perl-Net noarch 1.04-520.fc43 fedora 22.4 KiB perl-Net-Ping noarch 2.76-520.fc43 fedora 134.2 KiB perl-Net-SSLeay x86_64 1.94-11.fc43 fedora 1.3 MiB perl-ODBM_File x86_64 1.20-520.fc43 fedora 28.5 KiB perl-Opcode x86_64 1.69-520.fc43 fedora 48.6 KiB perl-POSIX x86_64 2.23-520.fc43 fedora 231.4 KiB perl-Package-Generator noarch 1.106-34.fc43 fedora 29.9 KiB perl-Params-Check noarch 1:0.38-520.fc43 fedora 27.6 KiB perl-Params-Util x86_64 1.102-19.fc43 fedora 58.5 KiB perl-PathTools x86_64 3.94-520.fc43 fedora 180.0 KiB perl-Perl-OSType noarch 1.010-521.fc43 fedora 32.8 KiB perl-PerlIO-via-QuotedPrint noarch 0.10-520.fc43 fedora 30.2 KiB perl-Pod-Checker noarch 4:1.77-520.fc43 fedora 52.2 KiB perl-Pod-Escapes noarch 1:1.07-520.fc43 fedora 24.9 KiB perl-Pod-Functions noarch 1.14-520.fc43 fedora 14.4 KiB perl-Pod-Html noarch 1.35-520.fc43 fedora 42.3 KiB perl-Pod-Perldoc noarch 3.28.01-521.fc43 fedora 163.7 KiB perl-Pod-Simple noarch 1:3.47-3.fc43 fedora 565.3 KiB perl-Pod-Usage noarch 4:2.05-520.fc43 fedora 86.3 KiB perl-Safe noarch 2.47-520.fc43 fedora 30.7 KiB perl-Scalar-List-Utils x86_64 5:1.70-1.fc43 fedora 144.9 KiB perl-Search-Dict noarch 1.08-520.fc43 fedora 4.7 KiB perl-SelectSaver noarch 1.02-520.fc43 fedora 2.2 KiB perl-SelfLoader noarch 1.28-520.fc43 fedora 22.2 KiB perl-Socket x86_64 4:2.040-2.fc43 fedora 120.3 KiB perl-Software-License noarch 0.104007-2.fc43 fedora 500.7 KiB perl-Storable x86_64 1:3.37-521.fc43 fedora 231.2 KiB perl-Sub-Exporter noarch 0.991-6.fc43 fedora 194.9 KiB perl-Sub-Install noarch 0.929-8.fc43 fedora 35.9 KiB perl-Symbol noarch 1.09-520.fc43 fedora 6.8 KiB perl-Sys-Hostname x86_64 1.25-520.fc43 fedora 15.8 KiB perl-Sys-Syslog x86_64 0.36-521.fc43 fedora 94.7 KiB perl-Term-ANSIColor noarch 5.01-521.fc43 fedora 97.5 KiB perl-Term-Cap noarch 1.18-520.fc43 fedora 29.3 KiB perl-Term-Complete noarch 1.403-520.fc43 fedora 5.8 KiB perl-Term-ReadLine noarch 1.17-520.fc43 fedora 17.3 KiB perl-Term-Table noarch 0.024-520.fc43 fedora 77.9 KiB perl-TermReadKey x86_64 2.38-26.fc43 fedora 64.0 KiB perl-Test noarch 1.31-520.fc43 fedora 37.0 KiB perl-Test-Harness noarch 1:3.52-4.fc43 fedora 560.8 KiB perl-Test-Simple noarch 3:1.302214-4.fc43 fedora 1.7 MiB perl-Text-Abbrev noarch 1.02-520.fc43 fedora 3.1 KiB perl-Text-Balanced noarch 2.07-1.fc43 fedora 111.5 KiB perl-Text-Diff noarch 1.45-24.fc43 fedora 83.0 KiB perl-Text-Glob noarch 0.11-26.fc43 fedora 8.4 KiB perl-Text-ParseWords noarch 3.31-520.fc43 fedora 13.6 KiB perl-Text-Tabs+Wrap noarch 2024.001-520.fc43 fedora 22.6 KiB perl-Text-Template noarch 1.61-8.fc43 fedora 112.4 KiB perl-Thread noarch 3.06-520.fc43 fedora 12.1 KiB perl-Thread-Queue noarch 3.14-520.fc43 fedora 28.9 KiB perl-Thread-Semaphore noarch 2.13-520.fc43 fedora 10.0 KiB perl-Tie noarch 4.6-520.fc43 fedora 32.1 KiB perl-Tie-File noarch 1.10-520.fc43 fedora 85.6 KiB perl-Tie-Memoize noarch 1.1-520.fc43 fedora 6.2 KiB perl-Tie-RefHash noarch 1.41-520.fc43 fedora 35.9 KiB perl-Time noarch 1.04-520.fc43 fedora 9.8 KiB perl-Time-HiRes x86_64 4:1.9778-520.fc43 fedora 115.8 KiB perl-Time-Local noarch 2:1.350-520.fc43 fedora 69.0 KiB perl-Time-Piece x86_64 1.3600-520.fc43 fedora 71.2 KiB perl-URI noarch 5.32-2.fc43 fedora 261.2 KiB perl-Unicode-Collate x86_64 1.31-520.fc43 fedora 4.2 MiB perl-Unicode-Normalize x86_64 1.32-520.fc43 fedora 486.2 KiB perl-Unicode-UCD noarch 0.81-520.fc43 fedora 206.4 KiB perl-User-pwent noarch 1.05-520.fc43 fedora 17.1 KiB perl-autodie noarch 2.37-521.fc43 fedora 214.9 KiB perl-autouse noarch 1.11-520.fc43 fedora 5.9 KiB perl-base noarch 2.27-520.fc43 fedora 12.6 KiB perl-bignum noarch 0.67-521.fc43 fedora 133.1 KiB perl-blib noarch 1.07-520.fc43 fedora 3.2 KiB perl-constant noarch 1.33-521.fc43 fedora 26.2 KiB perl-debugger noarch 1.60-520.fc43 fedora 403.2 KiB perl-deprecate noarch 0.04-520.fc43 fedora 6.6 KiB perl-devel x86_64 4:5.42.0-520.fc43 fedora 3.8 MiB perl-diagnostics noarch 1.40-520.fc43 fedora 471.0 KiB perl-doc noarch 5.42.0-520.fc43 fedora 11.5 MiB perl-encoding x86_64 4:3.00-520.fc43 fedora 149.5 KiB perl-encoding-warnings noarch 0.14-520.fc43 fedora 10.1 KiB perl-experimental noarch 0.036-2.fc43 fedora 43.4 KiB perl-fields noarch 2.27-520.fc43 fedora 11.9 KiB perl-filetest noarch 1.03-520.fc43 fedora 6.4 KiB perl-if noarch 0.61.000-520.fc43 fedora 5.8 KiB perl-inc-latest noarch 2:0.500-31.fc43 fedora 34.6 KiB perl-interpreter x86_64 4:5.42.0-520.fc43 fedora 118.6 KiB perl-less noarch 0.03-520.fc43 fedora 4.9 KiB perl-lib x86_64 0.65-520.fc43 fedora 8.5 KiB perl-libnet noarch 3.15-521.fc43 fedora 289.4 KiB perl-libnetcfg noarch 4:5.42.0-520.fc43 fedora 16.9 KiB perl-libs x86_64 4:5.42.0-520.fc43 fedora 11.5 MiB perl-local-lib noarch 2.000029-10.fc43 fedora 117.6 KiB perl-locale noarch 1.13-520.fc43 fedora 6.1 KiB perl-macros noarch 4:5.42.0-520.fc43 fedora 5.5 KiB perl-meta-notation noarch 5.42.0-520.fc43 fedora 2.0 KiB perl-mro x86_64 1.29-520.fc43 fedora 41.6 KiB perl-open noarch 1.13-520.fc43 fedora 11.3 KiB perl-overload noarch 1.40-520.fc43 fedora 71.6 KiB perl-overloading noarch 0.02-520.fc43 fedora 4.9 KiB perl-parent noarch 1:0.244-520.fc43 fedora 10.3 KiB perl-perlfaq noarch 5.20250619-520.fc43 fedora 733.6 KiB perl-ph x86_64 5.42.0-520.fc43 fedora 276.4 KiB perl-podlators noarch 1:6.0.2-520.fc43 fedora 317.5 KiB perl-sigtrap noarch 1.10-520.fc43 fedora 11.1 KiB perl-sort noarch 2.06-520.fc43 fedora 4.8 KiB perl-subs noarch 1.04-520.fc43 fedora 2.1 KiB perl-threads x86_64 1:2.43-520.fc43 fedora 115.1 KiB perl-threads-shared x86_64 1.70-520.fc43 fedora 83.6 KiB perl-utils noarch 5.42.0-520.fc43 fedora 97.0 KiB perl-vars noarch 1.05-520.fc43 fedora 3.9 KiB perl-version x86_64 9:0.99.33-521.fc43 fedora 128.7 KiB perl-vmsish noarch 1.04-520.fc43 fedora 6.6 KiB procps-ng x86_64 4.0.4-7.fc43 fedora 1.0 MiB python-pip-wheel noarch 25.1.1-14.fc43 fedora 1.2 MiB python3 x86_64 3.14.0~rc1-2.fc43 fedora 28.9 KiB python3-libs x86_64 3.14.0~rc1-2.fc43 fedora 42.9 MiB python3-pyparsing noarch 3.1.2-12.fc43 fedora 1.0 MiB rhash x86_64 1.4.5-3.fc43 fedora 351.1 KiB rocm-clang x86_64 19-15.rocm6.4.2.fc43 copr_base 70.2 MiB rocm-clang-devel x86_64 19-15.rocm6.4.2.fc43 copr_base 23.3 MiB rocm-clang-libs x86_64 19-15.rocm6.4.2.fc43 copr_base 98.4 MiB rocm-clang-runtime-devel x86_64 19-15.rocm6.4.2.fc43 copr_base 7.8 MiB rocm-comgr x86_64 19-15.rocm6.4.2.fc43 copr_base 123.9 MiB rocm-core x86_64 6.4.3-1.fc43 copr_base 12.3 KiB rocm-device-libs x86_64 19-15.rocm6.4.2.fc43 copr_base 3.2 MiB rocm-hip x86_64 6.4.2-2.fc43 copr_base 24.9 MiB rocm-libc++ x86_64 19-15.rocm6.4.2.fc43 copr_base 1.2 MiB rocm-libc++-devel x86_64 19-15.rocm6.4.2.fc43 copr_base 7.5 MiB rocm-lld x86_64 19-15.rocm6.4.2.fc43 copr_base 5.7 MiB rocm-llvm x86_64 19-15.rocm6.4.2.fc43 copr_base 48.5 MiB rocm-llvm-devel x86_64 19-15.rocm6.4.2.fc43 copr_base 25.3 MiB rocm-llvm-filesystem x86_64 19-15.rocm6.4.2.fc43 copr_base 0.0 B rocm-llvm-libs x86_64 19-15.rocm6.4.2.fc43 copr_base 84.8 MiB rocm-llvm-static x86_64 19-15.rocm6.4.2.fc43 copr_base 1.8 GiB rocm-runtime x86_64 6.4.2-2.fc43 copr_base 3.1 MiB rocm-smi x86_64 6.4.3-1.fc43 copr_base 2.7 MiB systemtap-sdt-devel x86_64 5.3-3.fc43 fedora 182.9 KiB systemtap-sdt-dtrace x86_64 5.3-3.fc43 fedora 179.6 KiB tcl x86_64 1:9.0.2-1.fc43 fedora 4.3 MiB tzdata noarch 2025b-3.fc43 fedora 1.6 MiB vim-filesystem noarch 2:9.1.1591-1.fc43 fedora 40.0 B zlib-ng-compat-devel x86_64 2.2.5-1.fc43 fedora 107.0 KiB Transaction Summary: Installing: 301 packages Total size of inbound packages is 529 MiB. Need to download 529 MiB. After this operation, 3 GiB extra will be used (install 3 GiB, remove 0 B). [ 1/301] hipify-0:6.4.1-3.fc43.x86_64 100% | 1.6 MiB/s | 505.6 KiB | 00m00s [ 2/301] rocm-cmake-0:6.4.0-2.fc43.noa 100% | 91.7 KiB/s | 37.7 KiB | 00m00s [ 3/301] cmake-0:3.31.6-4.fc43.x86_64 100% | 15.9 MiB/s | 12.2 MiB | 00m01s [ 4/301] rocm-comgr-devel-0:19-15.rocm 100% | 96.5 KiB/s | 32.4 KiB | 00m00s [ 5/301] rocm-core-devel-0:6.4.3-1.fc4 100% | 34.0 KiB/s | 13.7 KiB | 00m00s [ 6/301] rocm-rpm-macros-0:6.4.2-1.fc4 100% | 303.6 KiB/s | 16.1 KiB | 00m00s [ 7/301] rocm-hip-devel-0:6.4.2-2.fc43 100% | 403.2 KiB/s | 233.9 KiB | 00m01s [ 8/301] rocm-runtime-devel-0:6.4.2-2. 100% | 156.3 KiB/s | 93.9 KiB | 00m01s [ 9/301] cmake-data-0:3.31.6-4.fc43.no 100% | 18.0 MiB/s | 2.5 MiB | 00m00s [ 10/301] cmake-filesystem-0:3.31.6-4.f 100% | 595.7 KiB/s | 15.5 KiB | 00m00s [ 11/301] expat-0:2.7.1-3.fc43.x86_64 100% | 3.5 MiB/s | 116.1 KiB | 00m00s [ 12/301] jsoncpp-0:1.9.6-2.fc43.x86_64 100% | 1.3 MiB/s | 101.1 KiB | 00m00s [ 13/301] rocm-smi-devel-0:6.4.3-1.fc43 100% | 115.7 KiB/s | 57.5 KiB | 00m00s [ 14/301] libuv-1:1.51.0-2.fc43.x86_64 100% | 3.0 MiB/s | 266.1 KiB | 00m00s [ 15/301] rhash-0:1.4.5-3.fc43.x86_64 100% | 5.7 MiB/s | 197.9 KiB | 00m00s [ 16/301] gcc-c++-0:15.2.1-1.fc43.x86_6 100% | 6.7 MiB/s | 15.3 MiB | 00m02s [ 17/301] make-1:4.4.1-11.fc43.x86_64 100% | 3.6 MiB/s | 585.2 KiB | 00m00s [ 18/301] libmpc-0:1.3.1-8.fc43.x86_64 100% | 1.9 MiB/s | 70.4 KiB | 00m00s [ 19/301] perl-4:5.42.0-520.fc43.x86_64 100% | 219.6 KiB/s | 13.6 KiB | 00m00s [ 20/301] perl-interpreter-4:5.42.0-520 100% | 1.8 MiB/s | 72.4 KiB | 00m00s [ 21/301] libstdc++-devel-0:15.2.1-1.fc 100% | 5.9 MiB/s | 5.2 MiB | 00m01s [ 22/301] perl-File-Basename-0:2.86-520 100% | 536.5 KiB/s | 17.2 KiB | 00m00s [ 23/301] perl-File-Copy-0:2.41-520.fc4 100% | 745.5 KiB/s | 20.1 KiB | 00m00s [ 24/301] perl-File-Which-0:1.27-14.fc4 100% | 648.3 KiB/s | 21.4 KiB | 00m00s [ 25/301] perl-Getopt-Std-0:1.14-520.fc 100% | 349.0 KiB/s | 15.7 KiB | 00m00s [ 26/301] perl-PathTools-0:3.94-520.fc4 100% | 2.9 MiB/s | 87.2 KiB | 00m00s [ 27/301] perl-Scalar-List-Utils-5:1.70 100% | 543.1 KiB/s | 75.0 KiB | 00m00s [ 28/301] perl-URI-0:5.32-2.fc43.noarch 100% | 5.2 MiB/s | 143.6 KiB | 00m00s [ 29/301] environment-modules-0:5.6.0-1 100% | 6.1 MiB/s | 795.3 KiB | 00m00s [ 30/301] emacs-filesystem-1:30.0-5.fc4 100% | 220.2 KiB/s | 7.5 KiB | 00m00s [ 31/301] rocm-comgr-0:19-15.rocm6.4.2. 100% | 13.2 MiB/s | 30.5 MiB | 00m02s [ 32/301] gcc-0:15.2.1-1.fc43.x86_64 100% | 15.2 MiB/s | 39.6 MiB | 00m03s [ 33/301] cpp-0:15.2.1-1.fc43.x86_64 100% | 28.0 MiB/s | 12.9 MiB | 00m00s [ 34/301] vim-filesystem-2:9.1.1591-1.f 100% | 6.7 KiB/s | 15.3 KiB | 00m02s [ 35/301] perl-Archive-Tar-0:3.04-521.f 100% | 61.2 KiB/s | 70.9 KiB | 00m01s [ 36/301] perl-Attribute-Handlers-0:1.0 100% | 36.0 KiB/s | 28.1 KiB | 00m01s [ 37/301] perl-AutoLoader-0:5.74-520.fc 100% | 198.6 KiB/s | 21.2 KiB | 00m00s [ 38/301] perl-B-0:1.89-520.fc43.x86_64 100% | 2.5 MiB/s | 177.7 KiB | 00m00s [ 39/301] perl-AutoSplit-0:5.74-520.fc4 100% | 227.6 KiB/s | 21.6 KiB | 00m00s [ 40/301] perl-Benchmark-0:1.27-520.fc4 100% | 228.9 KiB/s | 26.8 KiB | 00m00s [ 41/301] perl-CPAN-Meta-0:2.150010-520 100% | 2.0 MiB/s | 190.8 KiB | 00m00s [ 42/301] perl-CPAN-0:2.38-521.fc43.noa 100% | 4.6 MiB/s | 567.4 KiB | 00m00s [ 43/301] perl-CPAN-Meta-Requirements-0 100% | 783.2 KiB/s | 34.5 KiB | 00m00s [ 44/301] perl-Carp-0:1.54-520.fc43.noa 100% | 1.3 MiB/s | 28.7 KiB | 00m00s [ 45/301] perl-CPAN-Meta-YAML-0:0.020-5 100% | 720.7 KiB/s | 26.7 KiB | 00m00s [ 46/301] perl-Compress-Raw-Bzip2-0:2.2 100% | 2.2 MiB/s | 35.9 KiB | 00m00s [ 47/301] perl-Class-Struct-0:0.68-520. 100% | 919.8 KiB/s | 22.1 KiB | 00m00s [ 48/301] perl-Compress-Raw-Zlib-0:2.21 100% | 3.0 MiB/s | 65.0 KiB | 00m00s [ 49/301] perl-Config-Extensions-0:0.03 100% | 408.7 KiB/s | 12.3 KiB | 00m00s [ 50/301] perl-Config-Perl-V-0:0.38-521 100% | 652.5 KiB/s | 21.5 KiB | 00m00s [ 51/301] perl-DBM_Filter-0:0.07-520.fc 100% | 736.7 KiB/s | 27.3 KiB | 00m00s [ 52/301] perl-Data-Dumper-0:2.191-521. 100% | 1.7 MiB/s | 56.3 KiB | 00m00s [ 53/301] perl-Devel-PPPort-0:3.73-521. 100% | 3.6 MiB/s | 219.9 KiB | 00m00s [ 54/301] perl-Devel-Peek-0:1.36-520.fc 100% | 725.5 KiB/s | 31.9 KiB | 00m00s [ 55/301] perl-DB_File-0:1.859-516.fc43 100% | 928.7 KiB/s | 80.8 KiB | 00m00s [ 56/301] perl-Devel-SelfStubber-0:1.06 100% | 387.2 KiB/s | 14.3 KiB | 00m00s [ 57/301] perl-Digest-0:1.20-520.fc43.n 100% | 527.4 KiB/s | 24.8 KiB | 00m00s [ 58/301] perl-Digest-SHA-1:6.04-521.fc 100% | 2.7 MiB/s | 61.8 KiB | 00m00s [ 59/301] perl-DirHandle-0:1.05-520.fc4 100% | 346.0 KiB/s | 12.5 KiB | 00m00s [ 60/301] perl-Dumpvalue-0:2.27-520.fc4 100% | 407.2 KiB/s | 18.3 KiB | 00m00s [ 61/301] perl-DynaLoader-0:1.57-520.fc 100% | 1.2 MiB/s | 26.0 KiB | 00m00s [ 62/301] perl-Digest-MD5-0:2.59-520.fc 100% | 296.1 KiB/s | 35.8 KiB | 00m00s [ 63/301] perl-English-0:1.11-520.fc43. 100% | 544.0 KiB/s | 13.6 KiB | 00m00s [ 64/301] perl-Env-0:1.06-520.fc43.noar 100% | 472.6 KiB/s | 19.4 KiB | 00m00s [ 65/301] perl-Exporter-0:5.79-520.fc43 100% | 1.3 MiB/s | 30.9 KiB | 00m00s [ 66/301] perl-ExtUtils-CBuilder-1:0.28 100% | 1.3 MiB/s | 50.6 KiB | 00m00s [ 67/301] perl-ExtUtils-Command-2:7.76- 100% | 290.7 KiB/s | 14.0 KiB | 00m00s [ 68/301] perl-ExtUtils-Constant-0:0.25 100% | 2.1 MiB/s | 43.8 KiB | 00m00s [ 69/301] perl-ExtUtils-Embed-0:1.35-52 100% | 707.3 KiB/s | 17.7 KiB | 00m00s [ 70/301] perl-ExtUtils-MM-Utils-2:7.76 100% | 640.1 KiB/s | 11.5 KiB | 00m00s [ 71/301] perl-ExtUtils-Install-0:2.22- 100% | 1.0 MiB/s | 43.4 KiB | 00m00s [ 72/301] perl-ExtUtils-MakeMaker-2:7.7 100% | 5.0 MiB/s | 294.6 KiB | 00m00s [ 73/301] perl-ExtUtils-Miniperl-0:1.14 100% | 939.5 KiB/s | 15.0 KiB | 00m00s [ 74/301] perl-ExtUtils-ParseXS-1:3.58- 100% | 6.2 MiB/s | 220.7 KiB | 00m00s [ 75/301] perl-Fcntl-0:1.20-520.fc43.x8 100% | 1.5 MiB/s | 29.8 KiB | 00m00s [ 76/301] perl-Errno-0:1.38-520.fc43.x8 100% | 57.9 KiB/s | 14.9 KiB | 00m00s [ 77/301] perl-File-Compare-0:1.100.800 100% | 631.2 KiB/s | 13.3 KiB | 00m00s [ 78/301] perl-File-Fetch-0:1.08-3.fc43 100% | 1.3 MiB/s | 30.7 KiB | 00m00s [ 79/301] perl-File-Find-0:1.44-520.fc4 100% | 1.5 MiB/s | 25.3 KiB | 00m00s [ 80/301] perl-File-DosGlob-0:1.12-520. 100% | 326.5 KiB/s | 19.6 KiB | 00m00s [ 81/301] perl-File-Path-0:2.18-520.fc4 100% | 1.9 MiB/s | 35.1 KiB | 00m00s [ 82/301] perl-File-stat-0:1.14-520.fc4 100% | 741.9 KiB/s | 17.1 KiB | 00m00s [ 83/301] perl-ExtUtils-Manifest-1:1.75 100% | 125.5 KiB/s | 34.0 KiB | 00m00s [ 84/301] perl-FileCache-0:1.10-520.fc4 100% | 342.5 KiB/s | 14.7 KiB | 00m00s [ 85/301] perl-FileHandle-0:2.05-520.fc 100% | 1.0 MiB/s | 15.5 KiB | 00m00s [ 86/301] perl-Filter-Simple-0:0.96-520 100% | 896.8 KiB/s | 26.9 KiB | 00m00s [ 87/301] perl-Filter-2:1.64-521.fc43.x 100% | 1.3 MiB/s | 85.9 KiB | 00m00s [ 88/301] perl-FindBin-0:1.54-520.fc43. 100% | 527.1 KiB/s | 14.2 KiB | 00m00s [ 89/301] perl-File-Temp-1:0.231.100-52 100% | 373.6 KiB/s | 59.0 KiB | 00m00s [ 90/301] perl-Getopt-Long-1:2.58-520.f 100% | 3.0 MiB/s | 63.6 KiB | 00m00s [ 91/301] perl-Hash-Util-0:0.32-520.fc4 100% | 1.3 MiB/s | 34.6 KiB | 00m00s [ 92/301] perl-Hash-Util-FieldHash-0:1. 100% | 2.2 MiB/s | 38.8 KiB | 00m00s [ 93/301] perl-GDBM_File-1:1.24-520.fc4 100% | 615.4 KiB/s | 42.5 KiB | 00m00s [ 94/301] perl-I18N-Collate-0:1.02-520. 100% | 383.4 KiB/s | 14.2 KiB | 00m00s [ 95/301] perl-I18N-Langinfo-0:0.24-520 100% | 1.6 MiB/s | 25.6 KiB | 00m00s [ 96/301] perl-HTTP-Tiny-0:0.090-521.fc 100% | 426.7 KiB/s | 56.3 KiB | 00m00s [ 97/301] perl-IO-0:1.55-520.fc43.x86_6 100% | 3.3 MiB/s | 82.2 KiB | 00m00s [ 98/301] perl-IO-Socket-IP-0:0.43-521. 100% | 1.7 MiB/s | 42.1 KiB | 00m00s [ 99/301] perl-I18N-LangTags-0:0.45-520 100% | 395.6 KiB/s | 52.6 KiB | 00m00s [100/301] perl-IO-Zlib-1:1.15-520.fc43. 100% | 556.2 KiB/s | 19.5 KiB | 00m00s [101/301] perl-IPC-Open3-0:1.24-520.fc4 100% | 920.4 KiB/s | 23.9 KiB | 00m00s [102/301] perl-IPC-Cmd-2:1.04-521.fc43. 100% | 501.1 KiB/s | 39.6 KiB | 00m00s [103/301] perl-JSON-PP-1:4.16-521.fc43. 100% | 668.0 KiB/s | 65.5 KiB | 00m00s [104/301] perl-IO-Compress-0:2.213-521. 100% | 875.1 KiB/s | 305.4 KiB | 00m00s [105/301] perl-Locale-Maketext-0:1.33-5 100% | 849.8 KiB/s | 93.5 KiB | 00m00s [106/301] perl-IPC-SysV-0:2.09-521.fc43 100% | 156.3 KiB/s | 40.6 KiB | 00m00s [107/301] perl-Locale-Maketext-Simple-1 100% | 325.6 KiB/s | 17.6 KiB | 00m00s [108/301] perl-Math-BigInt-FastCalc-0:0 100% | 575.9 KiB/s | 28.2 KiB | 00m00s [109/301] perl-Math-Complex-0:1.63-520. 100% | 699.5 KiB/s | 46.2 KiB | 00m00s [110/301] perl-Math-BigInt-1:2.0050.03- 100% | 1.1 MiB/s | 234.5 KiB | 00m00s [111/301] perl-Memoize-0:1.17-520.fc43. 100% | 490.1 KiB/s | 46.6 KiB | 00m00s [112/301] perl-Module-CoreList-1:5.2025 100% | 1.0 MiB/s | 93.3 KiB | 00m00s [113/301] perl-Module-CoreList-tools-1: 100% | 377.4 KiB/s | 18.9 KiB | 00m00s [114/301] perl-Module-Load-1:0.36-520.f 100% | 465.3 KiB/s | 17.2 KiB | 00m00s [115/301] perl-Module-Load-Conditional- 100% | 592.3 KiB/s | 21.9 KiB | 00m00s [116/301] perl-Module-Loaded-1:0.08-520 100% | 432.2 KiB/s | 13.4 KiB | 00m00s [117/301] perl-NDBM_File-0:1.18-520.fc4 100% | 481.2 KiB/s | 22.6 KiB | 00m00s [118/301] perl-Module-Metadata-0:1.0000 100% | 627.8 KiB/s | 35.2 KiB | 00m00s [119/301] perl-NEXT-0:0.69-520.fc43.noa 100% | 465.4 KiB/s | 20.9 KiB | 00m00s [120/301] perl-MIME-Base64-0:3.16-520.f 100% | 64.7 KiB/s | 29.7 KiB | 00m00s [121/301] perl-Net-0:1.04-520.fc43.noar 100% | 391.0 KiB/s | 22.7 KiB | 00m00s [122/301] perl-Opcode-0:1.69-520.fc43.x 100% | 852.8 KiB/s | 35.8 KiB | 00m00s [123/301] perl-Net-Ping-0:2.76-520.fc43 100% | 784.7 KiB/s | 49.4 KiB | 00m00s [124/301] perl-ODBM_File-0:1.20-520.fc4 100% | 259.0 KiB/s | 22.8 KiB | 00m00s [125/301] perl-Params-Check-1:0.38-520. 100% | 353.4 KiB/s | 21.6 KiB | 00m00s [126/301] perl-PerlIO-via-QuotedPrint-0 100% | 770.5 KiB/s | 21.6 KiB | 00m00s [127/301] perl-POSIX-0:2.23-520.fc43.x8 100% | 905.7 KiB/s | 97.8 KiB | 00m00s [128/301] perl-Perl-OSType-0:1.010-521. 100% | 261.3 KiB/s | 22.7 KiB | 00m00s [129/301] perl-Pod-Functions-0:1.14-520 100% | 319.0 KiB/s | 14.7 KiB | 00m00s [130/301] perl-Pod-Escapes-1:1.07-520.f 100% | 256.9 KiB/s | 19.8 KiB | 00m00s [131/301] perl-Pod-Html-0:1.35-520.fc43 100% | 447.4 KiB/s | 29.5 KiB | 00m00s [132/301] perl-Pod-Perldoc-0:3.28.01-52 100% | 969.0 KiB/s | 84.3 KiB | 00m00s [133/301] perl-Pod-Usage-4:2.05-520.fc4 100% | 664.5 KiB/s | 40.5 KiB | 00m00s [134/301] perl-Pod-Checker-4:1.77-520.f 100% | 111.1 KiB/s | 31.6 KiB | 00m00s [135/301] perl-Safe-0:2.47-520.fc43.noa 100% | 429.7 KiB/s | 24.9 KiB | 00m00s [136/301] perl-SelectSaver-0:1.02-520.f 100% | 404.2 KiB/s | 11.7 KiB | 00m00s [137/301] perl-Search-Dict-0:1.08-520.f 100% | 185.9 KiB/s | 13.0 KiB | 00m00s [138/301] perl-SelfLoader-0:1.28-520.fc 100% | 465.8 KiB/s | 21.4 KiB | 00m00s [139/301] perl-Socket-4:2.040-2.fc43.x8 100% | 763.1 KiB/s | 54.9 KiB | 00m00s [140/301] perl-Pod-Simple-1:3.47-3.fc43 100% | 631.8 KiB/s | 219.9 KiB | 00m00s [141/301] perl-Storable-1:3.37-521.fc43 100% | 864.5 KiB/s | 98.5 KiB | 00m00s [142/301] perl-Sys-Hostname-0:1.25-520. 100% | 451.5 KiB/s | 17.2 KiB | 00m00s [143/301] perl-Sys-Syslog-0:0.36-521.fc 100% | 545.2 KiB/s | 46.3 KiB | 00m00s [144/301] perl-Term-ANSIColor-0:5.01-52 100% | 609.9 KiB/s | 47.6 KiB | 00m00s [145/301] perl-Term-Cap-0:1.18-520.fc43 100% | 359.6 KiB/s | 21.9 KiB | 00m00s [146/301] perl-Term-Complete-0:1.403-52 100% | 224.1 KiB/s | 13.0 KiB | 00m00s [147/301] perl-Term-ReadLine-0:1.17-520 100% | 173.1 KiB/s | 19.0 KiB | 00m00s [148/301] perl-Term-Table-0:0.024-520.f 100% | 439.7 KiB/s | 43.1 KiB | 00m00s [149/301] perl-Test-0:1.31-520.fc43.noa 100% | 414.0 KiB/s | 28.6 KiB | 00m00s [150/301] perl-Symbol-0:1.09-520.fc43.n 100% | 28.5 KiB/s | 14.2 KiB | 00m00s [151/301] perl-Text-Abbrev-0:1.02-520.f 100% | 357.3 KiB/s | 12.1 KiB | 00m00s [152/301] perl-Text-Balanced-0:2.07-1.f 100% | 566.4 KiB/s | 48.7 KiB | 00m00s [153/301] perl-Text-ParseWords-0:3.31-5 100% | 510.8 KiB/s | 16.3 KiB | 00m00s [154/301] perl-Test-Harness-1:3.52-4.fc 100% | 849.2 KiB/s | 277.7 KiB | 00m00s [155/301] perl-Text-Tabs+Wrap-0:2024.00 100% | 373.0 KiB/s | 21.6 KiB | 00m00s [156/301] perl-Thread-0:3.06-520.fc43.n 100% | 333.5 KiB/s | 18.0 KiB | 00m00s [157/301] perl-Thread-Queue-0:3.14-520. 100% | 709.5 KiB/s | 21.3 KiB | 00m00s [158/301] perl-Thread-Semaphore-0:2.13- 100% | 240.9 KiB/s | 15.7 KiB | 00m00s [159/301] perl-Tie-0:4.6-520.fc43.noarc 100% | 515.3 KiB/s | 27.8 KiB | 00m00s [160/301] perl-Tie-Memoize-0:1.1-520.fc 100% | 523.0 KiB/s | 14.1 KiB | 00m00s [161/301] perl-Tie-RefHash-0:1.41-520.f 100% | 712.4 KiB/s | 23.5 KiB | 00m00s [162/301] perl-Time-0:1.04-520.fc43.noa 100% | 580.2 KiB/s | 16.8 KiB | 00m00s [163/301] perl-Time-HiRes-4:1.9778-520. 100% | 865.0 KiB/s | 57.1 KiB | 00m00s [164/301] perl-Time-Local-2:1.350-520.f 100% | 625.0 KiB/s | 34.4 KiB | 00m00s [165/301] perl-Tie-File-0:1.10-520.fc43 100% | 144.6 KiB/s | 43.2 KiB | 00m00s [166/301] perl-Time-Piece-0:1.3600-520. 100% | 474.9 KiB/s | 40.4 KiB | 00m00s [167/301] perl-Unicode-Normalize-0:1.32 100% | 860.1 KiB/s | 74.0 KiB | 00m00s [168/301] perl-Unicode-UCD-0:0.81-520.f 100% | 967.7 KiB/s | 79.4 KiB | 00m00s [169/301] perl-User-pwent-0:1.05-520.fc 100% | 425.2 KiB/s | 19.6 KiB | 00m00s [170/301] perl-Test-Simple-3:1.302214-4 100% | 878.3 KiB/s | 863.4 KiB | 00m01s [171/301] perl-autodie-0:2.37-521.fc43. 100% | 1.3 MiB/s | 96.7 KiB | 00m00s [172/301] perl-base-0:2.27-520.fc43.noa 100% | 1.0 MiB/s | 16.2 KiB | 00m00s [173/301] perl-autouse-0:1.11-520.fc43. 100% | 394.1 KiB/s | 13.8 KiB | 00m00s [174/301] perl-bignum-0:0.67-521.fc43.n 100% | 978.5 KiB/s | 48.9 KiB | 00m00s [175/301] perl-blib-0:1.07-520.fc43.noa 100% | 269.7 KiB/s | 12.4 KiB | 00m00s [176/301] perl-Unicode-Collate-0:1.31-5 100% | 1.7 MiB/s | 645.1 KiB | 00m00s [177/301] perl-constant-0:1.33-521.fc43 100% | 990.0 KiB/s | 22.8 KiB | 00m00s [178/301] perl-deprecate-0:0.04-520.fc4 100% | 694.1 KiB/s | 14.6 KiB | 00m00s [179/301] perl-diagnostics-0:1.40-520.f 100% | 2.2 MiB/s | 220.5 KiB | 00m00s [180/301] perl-debugger-0:1.60-520.fc43 100% | 877.7 KiB/s | 133.4 KiB | 00m00s [181/301] perl-encoding-4:3.00-520.fc43 100% | 676.0 KiB/s | 62.9 KiB | 00m00s [182/301] perl-encoding-warnings-0:0.14 100% | 517.3 KiB/s | 16.6 KiB | 00m00s [183/301] perl-experimental-0:0.036-2.f 100% | 496.4 KiB/s | 27.3 KiB | 00m00s [184/301] perl-devel-4:5.42.0-520.fc43. 100% | 1.9 MiB/s | 663.7 KiB | 00m00s [185/301] perl-fields-0:2.27-520.fc43.n 100% | 374.8 KiB/s | 16.1 KiB | 00m00s [186/301] perl-filetest-0:1.03-520.fc43 100% | 730.0 KiB/s | 14.6 KiB | 00m00s [187/301] perl-if-0:0.61.000-520.fc43.n 100% | 500.1 KiB/s | 14.0 KiB | 00m00s [188/301] perl-less-0:0.03-520.fc43.noa 100% | 347.2 KiB/s | 13.2 KiB | 00m00s [189/301] perl-lib-0:0.65-520.fc43.x86_ 100% | 439.7 KiB/s | 15.0 KiB | 00m00s [190/301] perl-libnetcfg-4:5.42.0-520.f 100% | 429.6 KiB/s | 16.3 KiB | 00m00s [191/301] perl-libnet-0:3.15-521.fc43.n 100% | 1.9 MiB/s | 128.3 KiB | 00m00s [192/301] perl-locale-0:1.13-520.fc43.n 100% | 150.0 KiB/s | 13.5 KiB | 00m00s [193/301] perl-macros-4:5.42.0-520.fc43 100% | 585.5 KiB/s | 12.3 KiB | 00m00s [194/301] perl-meta-notation-0:5.42.0-5 100% | 761.6 KiB/s | 10.7 KiB | 00m00s [195/301] perl-mro-0:1.29-520.fc43.x86_ 100% | 1.2 MiB/s | 29.9 KiB | 00m00s [196/301] perl-open-0:1.13-520.fc43.noa 100% | 532.8 KiB/s | 16.5 KiB | 00m00s [197/301] perl-overload-0:1.40-520.fc43 100% | 1.1 MiB/s | 45.6 KiB | 00m00s [198/301] perl-overloading-0:0.02-520.f 100% | 645.5 KiB/s | 12.9 KiB | 00m00s [199/301] perl-parent-1:0.244-520.fc43. 100% | 643.7 KiB/s | 14.8 KiB | 00m00s [200/301] perl-perlfaq-0:5.20250619-520 100% | 1.2 MiB/s | 378.6 KiB | 00m00s [201/301] perl-ph-0:5.42.0-520.fc43.x86 100% | 1.1 MiB/s | 51.0 KiB | 00m00s [202/301] perl-podlators-1:6.0.2-520.fc 100% | 1.2 MiB/s | 128.3 KiB | 00m00s [203/301] perl-doc-0:5.42.0-520.fc43.no 100% | 4.5 MiB/s | 5.0 MiB | 00m01s [204/301] perl-sigtrap-0:1.10-520.fc43. 100% | 489.4 KiB/s | 15.7 KiB | 00m00s [205/301] perl-sort-0:2.06-520.fc43.noa 100% | 733.2 KiB/s | 13.2 KiB | 00m00s [206/301] perl-subs-0:1.04-520.fc43.noa 100% | 532.4 KiB/s | 11.7 KiB | 00m00s [207/301] perl-threads-1:2.43-520.fc43. 100% | 2.2 MiB/s | 57.9 KiB | 00m00s [208/301] perl-utils-0:5.42.0-520.fc43. 100% | 1.1 MiB/s | 52.5 KiB | 00m00s [209/301] perl-threads-shared-0:1.70-52 100% | 766.0 KiB/s | 44.4 KiB | 00m00s [210/301] perl-vars-0:1.05-520.fc43.noa 100% | 1.4 MiB/s | 13.0 KiB | 00m00s [211/301] perl-vmsish-0:1.04-520.fc43.n 100% | 1.4 MiB/s | 14.1 KiB | 00m00s [212/301] perl-MIME-Base32-0:1.303-24.f 100% | 1.4 MiB/s | 20.4 KiB | 00m00s [213/301] perl-version-9:0.99.33-521.fc 100% | 1.6 MiB/s | 62.8 KiB | 00m00s [214/301] less-0:679-2.fc43.x86_64 100% | 3.7 MiB/s | 195.3 KiB | 00m00s [215/301] perl-IO-Compress-Lzma-0:2.213 100% | 1.8 MiB/s | 76.6 KiB | 00m00s [216/301] perl-Text-Diff-0:1.45-24.fc43 100% | 1.7 MiB/s | 39.9 KiB | 00m00s [217/301] perl-libs-4:5.42.0-520.fc43.x 100% | 2.5 MiB/s | 2.6 MiB | 00m01s [218/301] perl-Archive-Zip-0:1.68-17.fc 100% | 3.7 MiB/s | 111.2 KiB | 00m00s [219/301] perl-Compress-Bzip2-0:2.28-24 100% | 2.0 MiB/s | 66.6 KiB | 00m00s [220/301] perl-Devel-Size-0:0.85-3.fc43 100% | 1.8 MiB/s | 30.6 KiB | 00m00s [221/301] perl-File-HomeDir-0:1.006-15. 100% | 2.9 MiB/s | 59.2 KiB | 00m00s [222/301] man-db-0:2.13.1-2.fc43.x86_64 100% | 7.7 MiB/s | 1.4 MiB | 00m00s [223/301] perl-Text-Glob-0:0.11-26.fc43 100% | 694.4 KiB/s | 13.2 KiB | 00m00s [224/301] perl-Module-Signature-0:0.93- 100% | 2.4 MiB/s | 87.6 KiB | 00m00s [225/301] perl-Module-Build-2:0.42.34-9 100% | 4.4 MiB/s | 251.1 KiB | 00m00s [226/301] perl-local-lib-0:2.000029-10. 100% | 3.1 MiB/s | 66.1 KiB | 00m00s [227/301] libdb-0:5.3.28-66.fc43.x86_64 100% | 14.9 MiB/s | 778.2 KiB | 00m00s [228/301] perl-Net-SSLeay-0:1.94-11.fc4 100% | 5.2 MiB/s | 374.8 KiB | 00m00s [229/301] perl-IO-Socket-SSL-0:2.095-2. 100% | 3.0 MiB/s | 231.5 KiB | 00m00s [230/301] groff-base-0:1.23.0-9.fc43.x8 100% | 17.4 MiB/s | 1.1 MiB | 00m00s [231/301] ncurses-0:6.5-7.20250614.fc43 100% | 9.7 MiB/s | 426.2 KiB | 00m00s [232/301] perl-IPC-System-Simple-0:1.30 100% | 410.7 KiB/s | 38.6 KiB | 00m00s [233/301] libxcrypt-devel-0:4.4.38-8.fc 100% | 485.8 KiB/s | 29.2 KiB | 00m00s [234/301] systemtap-sdt-dtrace-0:5.3-3. 100% | 1.0 MiB/s | 69.6 KiB | 00m00s [235/301] libpipeline-0:1.5.8-3.fc43.x8 100% | 2.0 MiB/s | 59.9 KiB | 00m00s [236/301] perl-Compress-Raw-Lzma-0:2.21 100% | 1.3 MiB/s | 51.2 KiB | 00m00s [237/301] perl-Algorithm-Diff-0:1.2010- 100% | 986.1 KiB/s | 46.3 KiB | 00m00s [238/301] perl-inc-latest-2:0.500-31.fc 100% | 625.5 KiB/s | 23.1 KiB | 00m00s [239/301] perl-Software-License-0:0.104 100% | 2.9 MiB/s | 150.0 KiB | 00m00s [240/301] python3-pyparsing-0:3.1.2-12. 100% | 3.5 MiB/s | 286.7 KiB | 00m00s [241/301] glibc-devel-0:2.42-2.fc43.x86 100% | 5.2 MiB/s | 564.7 KiB | 00m00s [242/301] perl-Text-Template-0:1.61-8.f 100% | 1.3 MiB/s | 59.0 KiB | 00m00s [243/301] perl-MRO-Compat-0:0.15-12.fc4 100% | 664.1 KiB/s | 25.2 KiB | 00m00s [244/301] perl-Sub-Exporter-0:0.991-6.f 100% | 3.1 MiB/s | 77.3 KiB | 00m00s [245/301] perl-Data-OptList-0:0.114-7.f 100% | 1.0 MiB/s | 26.5 KiB | 00m00s [246/301] perl-Package-Generator-0:1.10 100% | 854.1 KiB/s | 22.2 KiB | 00m00s [247/301] perl-Params-Util-0:1.102-19.f 100% | 1.1 MiB/s | 32.6 KiB | 00m00s [248/301] python3-0:3.14.0~rc1-2.fc43.x 100% | 1.1 MiB/s | 27.4 KiB | 00m00s [249/301] perl-Sub-Install-0:0.929-8.fc 100% | 564.7 KiB/s | 22.6 KiB | 00m00s [250/301] perl-Data-Section-0:0.200008- 100% | 81.8 KiB/s | 24.9 KiB | 00m00s [251/301] mpdecimal-0:4.0.1-2.fc43.x86_ 100% | 1.0 MiB/s | 97.1 KiB | 00m00s [252/301] tzdata-0:2025b-3.fc43.noarch 100% | 2.6 MiB/s | 713.9 KiB | 00m00s [253/301] python-pip-wheel-0:25.1.1-14. 100% | 4.3 MiB/s | 1.2 MiB | 00m00s [254/301] systemtap-sdt-devel-0:5.3-3.f 100% | 1.2 MiB/s | 68.8 KiB | 00m00s [255/301] perl-Encode-devel-4:3.21-520. 100% | 931.1 KiB/s | 41.0 KiB | 00m00s [256/301] perl-Encode-4:3.21-520.fc43.x 100% | 3.1 MiB/s | 1.1 MiB | 00m00s [257/301] procps-ng-0:4.0.4-7.fc43.x86_ 100% | 3.1 MiB/s | 364.5 KiB | 00m00s [258/301] kernel-headers-0:6.16.0-65.fc 100% | 4.6 MiB/s | 1.7 MiB | 00m00s [259/301] libtommath-0:1.3.1~rc1-6.fc43 100% | 813.9 KiB/s | 64.3 KiB | 00m00s [260/301] rocm-smi-0:6.4.3-1.fc43.x86_6 100% | 15.2 MiB/s | 606.1 KiB | 00m00s [261/301] libdrm-devel-0:2.4.125-2.fc43 100% | 3.7 MiB/s | 183.4 KiB | 00m00s [262/301] libdrm-0:2.4.125-2.fc43.x86_6 100% | 3.6 MiB/s | 161.3 KiB | 00m00s [263/301] libpciaccess-0:0.16-16.fc43.x 100% | 1.7 MiB/s | 26.2 KiB | 00m00s [264/301] python3-libs-0:3.14.0~rc1-2.f 100% | 8.3 MiB/s | 9.8 MiB | 00m01s [265/301] tcl-1:9.0.2-1.fc43.x86_64 100% | 3.3 MiB/s | 1.2 MiB | 00m00s [266/301] rocm-runtime-0:6.4.2-2.fc43.x 100% | 14.1 MiB/s | 650.9 KiB | 00m00s [267/301] numactl-libs-0:2.0.19-3.fc43. 100% | 971.0 KiB/s | 31.1 KiB | 00m00s [268/301] hwdata-0:0.398-1.fc43.noarch 100% | 4.2 MiB/s | 1.7 MiB | 00m00s [269/301] rocm-hip-0:6.4.2-2.fc43.x86_6 100% | 28.3 MiB/s | 9.5 MiB | 00m00s [270/301] hipcc-0:19-15.rocm6.4.2.fc43. 100% | 418.5 KiB/s | 134.3 KiB | 00m00s [271/301] rocm-device-libs-0:19-15.rocm 100% | 4.0 MiB/s | 490.5 KiB | 00m00s [272/301] rocm-clang-devel-0:19-15.rocm 100% | 13.4 MiB/s | 2.4 MiB | 00m00s [273/301] rocm-clang-runtime-devel-0:19 100% | 1.0 MiB/s | 594.8 KiB | 00m01s [274/301] rocm-clang-0:19-15.rocm6.4.2. 100% | 18.4 MiB/s | 16.0 MiB | 00m01s [275/301] git-0:2.50.1-2.fc43.x86_64 100% | 772.6 KiB/s | 51.0 KiB | 00m00s [276/301] rocm-libc++-devel-0:19-15.roc 100% | 1.4 MiB/s | 904.6 KiB | 00m01s [277/301] rocm-clang-libs-0:19-15.rocm6 100% | 16.4 MiB/s | 22.8 MiB | 00m01s [278/301] rocm-libc++-0:19-15.rocm6.4.2 100% | 612.1 KiB/s | 346.5 KiB | 00m01s [279/301] rocm-llvm-libs-0:19-15.rocm6. 100% | 18.0 MiB/s | 20.2 MiB | 00m01s [280/301] perl-Git-0:2.50.1-2.fc43.noar 100% | 900.2 KiB/s | 37.8 KiB | 00m00s [281/301] git-core-doc-0:2.50.1-2.fc43. 100% | 9.9 MiB/s | 3.1 MiB | 00m00s [282/301] rocm-llvm-filesystem-0:19-15. 100% | 2.5 MiB/s | 25.3 KiB | 00m00s [283/301] git-core-0:2.50.1-2.fc43.x86_ 100% | 5.7 MiB/s | 5.0 MiB | 00m01s [284/301] openssh-clients-0:10.0p1-5.fc 100% | 15.5 MiB/s | 746.7 KiB | 00m00s [285/301] perl-Error-1:0.17030-2.fc43.n 100% | 1.6 MiB/s | 40.2 KiB | 00m00s [286/301] libedit-0:3.1-56.20250104cvs. 100% | 4.5 MiB/s | 105.2 KiB | 00m00s [287/301] libfido2-0:1.16.0-3.fc43.x86_ 100% | 2.7 MiB/s | 98.5 KiB | 00m00s [288/301] openssh-0:10.0p1-5.fc43.x86_6 100% | 7.7 MiB/s | 339.6 KiB | 00m00s [289/301] perl-TermReadKey-0:2.38-26.fc 100% | 110.5 KiB/s | 35.2 KiB | 00m00s [290/301] libcbor-0:0.12.0-6.fc43.x86_6 100% | 644.4 KiB/s | 33.5 KiB | 00m00s [291/301] rocm-lld-0:19-15.rocm6.4.2.fc 100% | 18.7 MiB/s | 1.5 MiB | 00m00s [292/301] rocm-llvm-devel-0:19-15.rocm6 100% | 5.5 MiB/s | 3.8 MiB | 00m01s [293/301] rocm-core-0:6.4.3-1.fc43.x86_ 100% | 49.1 KiB/s | 13.8 KiB | 00m00s [294/301] libpciaccess-devel-0:0.16-16. 100% | 429.1 KiB/s | 12.4 KiB | 00m00s [295/301] zlib-ng-compat-devel-0:2.2.5- 100% | 958.4 KiB/s | 38.3 KiB | 00m00s [296/301] rocm-llvm-0:19-15.rocm6.4.2.f 100% | 11.2 MiB/s | 13.1 MiB | 00m01s [297/301] gcc-plugin-annobin-0:15.2.1-1 100% | 408.0 KiB/s | 56.3 KiB | 00m00s [298/301] annobin-docs-0:12.99-1.fc43.n 100% | 2.6 MiB/s | 89.5 KiB | 00m00s [299/301] cmake-rpm-macros-0:3.31.6-4.f 100% | 510.6 KiB/s | 14.8 KiB | 00m00s [300/301] annobin-plugin-gcc-0:12.99-1. 100% | 5.5 MiB/s | 996.0 KiB | 00m00s [301/301] rocm-llvm-static-0:19-15.rocm 100% | 35.5 MiB/s | 260.6 MiB | 00m07s -------------------------------------------------------------------------------- [301/301] Total 100% | 22.7 MiB/s | 529.4 MiB | 00m23s Running transaction [ 1/303] Verify package files 100% | 132.0 B/s | 301.0 B | 00m02s [ 2/303] Prepare transaction 100% | 1.2 KiB/s | 301.0 B | 00m00s [ 3/303] Installing cmake-filesystem-0 100% | 1.9 MiB/s | 7.6 KiB | 00m00s [ 4/303] Installing less-0:679-2.fc43. 100% | 23.5 MiB/s | 409.4 KiB | 00m00s [ 5/303] Installing libmpc-0:1.3.1-8.f 100% | 79.1 MiB/s | 162.1 KiB | 00m00s [ 6/303] Installing make-1:4.4.1-11.fc 100% | 78.3 MiB/s | 1.8 MiB | 00m00s [ 7/303] Installing expat-0:2.7.1-3.fc 100% | 18.1 MiB/s | 296.3 KiB | 00m00s [ 8/303] Installing rocm-llvm-filesyst 100% | 2.3 MiB/s | 19.1 KiB | 00m00s [ 9/303] Installing rocm-libc++-0:19-1 100% | 31.6 MiB/s | 1.2 MiB | 00m00s [ 10/303] Installing rocm-llvm-libs-0:1 100% | 65.5 MiB/s | 84.8 MiB | 00m01s [ 11/303] Installing rocm-clang-libs-0: 100% | 67.4 MiB/s | 98.4 MiB | 00m01s [ 12/303] Installing numactl-libs-0:2.0 100% | 14.1 MiB/s | 57.8 KiB | 00m00s [ 13/303] Installing kernel-headers-0:6 100% | 103.4 MiB/s | 6.8 MiB | 00m00s [ 14/303] Installing libxcrypt-devel-0: 100% | 10.8 MiB/s | 33.1 KiB | 00m00s [ 15/303] Installing glibc-devel-0:2.42 100% | 56.0 MiB/s | 2.4 MiB | 00m00s [ 16/303] Installing groff-base-0:1.23. 100% | 75.4 MiB/s | 3.8 MiB | 00m00s [ 17/303] Installing vim-filesystem-2:9 100% | 2.3 MiB/s | 4.7 KiB | 00m00s [ 18/303] Installing emacs-filesystem-1 100% | 531.2 KiB/s | 544.0 B | 00m00s [ 19/303] Installing rocm-comgr-0:19-15 100% | 64.5 MiB/s | 123.9 MiB | 00m02s [ 20/303] Installing rocm-lld-0:19-15.r 100% | 61.0 MiB/s | 5.7 MiB | 00m00s [ 21/303] Installing rocm-libc++-devel- 100% | 61.2 MiB/s | 7.7 MiB | 00m00s [ 22/303] Installing cpp-0:15.2.1-1.fc4 100% | 271.1 MiB/s | 38.0 MiB | 00m00s [ 23/303] Installing gcc-0:15.2.1-1.fc4 100% | 297.6 MiB/s | 111.9 MiB | 00m00s [ 24/303] Installing zlib-ng-compat-dev 100% | 53.0 MiB/s | 108.5 KiB | 00m00s [ 25/303] Installing annobin-docs-0:12. 100% | 97.7 MiB/s | 100.1 KiB | 00m00s [ 26/303] Installing rocm-core-0:6.4.3- 100% | 13.2 MiB/s | 13.5 KiB | 00m00s [ 27/303] Installing libcbor-0:0.12.0-6 100% | 77.3 MiB/s | 79.2 KiB | 00m00s [ 28/303] Installing libfido2-0:1.16.0- 100% | 117.2 MiB/s | 240.0 KiB | 00m00s [ 29/303] Installing openssh-0:10.0p1-5 100% | 69.6 MiB/s | 1.4 MiB | 00m00s [ 30/303] Installing libedit-0:3.1-56.2 100% | 118.1 MiB/s | 241.8 KiB | 00m00s [ 31/303] Installing openssh-clients-0: 100% | 74.5 MiB/s | 2.6 MiB | 00m00s [ 32/303] Installing git-core-0:2.50.1- 100% | 253.1 MiB/s | 23.5 MiB | 00m00s [ 33/303] Installing git-core-doc-0:2.5 100% | 188.8 MiB/s | 17.9 MiB | 00m00s [ 34/303] Installing rocm-clang-runtime 100% | 98.2 MiB/s | 7.9 MiB | 00m00s [ 35/303] Installing hwdata-0:0.398-1.f 100% | 399.8 MiB/s | 9.6 MiB | 00m00s [ 36/303] Installing libpciaccess-0:0.1 100% | 44.8 MiB/s | 45.9 KiB | 00m00s [ 37/303] Installing libdrm-0:2.4.125-2 100% | 130.1 MiB/s | 399.7 KiB | 00m00s [ 38/303] Installing rocm-runtime-0:6.4 100% | 341.7 MiB/s | 3.1 MiB | 00m00s [ 39/303] Installing rocm-runtime-devel 100% | 187.2 MiB/s | 574.9 KiB | 00m00s [ 40/303] Installing libpciaccess-devel 100% | 15.5 MiB/s | 15.9 KiB | 00m00s [ 41/303] Installing libdrm-devel-0:2.4 100% | 144.1 MiB/s | 737.9 KiB | 00m00s [ 42/303] Installing libtommath-0:1.3.1 100% | 62.3 MiB/s | 127.5 KiB | 00m00s [ 43/303] Installing tcl-1:9.0.2-1.fc43 100% | 117.2 MiB/s | 4.3 MiB | 00m00s [ 44/303] Installing procps-ng-0:4.0.4- 100% | 48.1 MiB/s | 1.0 MiB | 00m00s [ 45/303] Installing systemtap-sdt-deve 100% | 36.0 MiB/s | 184.3 KiB | 00m00s [ 46/303] Installing tzdata-0:2025b-3.f 100% | 24.6 MiB/s | 1.9 MiB | 00m00s [ 47/303] Installing python-pip-wheel-0 100% | 415.0 MiB/s | 1.2 MiB | 00m00s [ 48/303] Installing mpdecimal-0:4.0.1- 100% | 26.7 MiB/s | 218.8 KiB | 00m00s [ 49/303] Installing python3-libs-0:3.1 100% | 195.0 MiB/s | 43.3 MiB | 00m00s [ 50/303] Installing python3-0:3.14.0~r 100% | 2.0 MiB/s | 30.7 KiB | 00m00s [ 51/303] Installing cmake-rpm-macros-0 100% | 8.1 MiB/s | 8.3 KiB | 00m00s [ 52/303] Installing python3-pyparsing- 100% | 171.6 MiB/s | 1.0 MiB | 00m00s [ 53/303] Installing systemtap-sdt-dtra 100% | 11.8 MiB/s | 180.9 KiB | 00m00s [ 54/303] Installing rocm-smi-0:6.4.3-1 100% | 120.7 MiB/s | 2.7 MiB | 00m00s [ 55/303] Installing rocm-llvm-0:19-15. 100% | 61.2 MiB/s | 48.5 MiB | 00m01s [ 56/303] Installing rocm-llvm-devel-0: 100% | 67.8 MiB/s | 25.7 MiB | 00m00s [ 57/303] Installing rocm-llvm-static-0 100% | 82.5 MiB/s | 1.8 GiB | 00m22s [ 58/303] Installing libpipeline-0:1.5. 100% | 5.7 MiB/s | 146.6 KiB | 00m00s [ 59/303] Installing man-db-0:2.13.1-2. 100% | 44.8 MiB/s | 2.9 MiB | 00m00s [ 60/303] Installing environment-module 100% | 41.0 MiB/s | 1.9 MiB | 00m00s [ 61/303] Installing ncurses-0:6.5-7.20 100% | 31.7 MiB/s | 616.4 KiB | 00m00s [ 62/303] Installing perl-Digest-0:1.20 100% | 18.1 MiB/s | 37.1 KiB | 00m00s [ 63/303] Installing perl-FileHandle-0: 100% | 9.6 MiB/s | 9.8 KiB | 00m00s [ 64/303] Installing perl-B-0:1.89-520. 100% | 164.3 MiB/s | 504.7 KiB | 00m00s [ 65/303] Installing perl-Digest-MD5-0: 100% | 30.1 MiB/s | 61.6 KiB | 00m00s [ 66/303] Installing perl-MIME-Base32-0 100% | 31.4 MiB/s | 32.2 KiB | 00m00s [ 67/303] Installing perl-libnet-0:3.15 100% | 95.9 MiB/s | 294.7 KiB | 00m00s [ 68/303] Installing perl-Data-Dumper-0 100% | 57.4 MiB/s | 117.5 KiB | 00m00s [ 69/303] Installing perl-URI-0:5.32-2. 100% | 44.6 MiB/s | 274.1 KiB | 00m00s [ 70/303] Installing perl-IO-Socket-IP- 100% | 99.8 MiB/s | 102.2 KiB | 00m00s [ 71/303] Installing perl-AutoLoader-0: 100% | 20.5 MiB/s | 21.0 KiB | 00m00s [ 72/303] Installing perl-Net-SSLeay-0: 100% | 135.9 MiB/s | 1.4 MiB | 00m00s [ 73/303] Installing perl-IO-Socket-SSL 100% | 175.4 MiB/s | 718.6 KiB | 00m00s [ 74/303] Installing perl-Pod-Escapes-1 100% | 25.3 MiB/s | 25.9 KiB | 00m00s [ 75/303] Installing perl-File-Path-0:2 100% | 63.0 MiB/s | 64.5 KiB | 00m00s [ 76/303] Installing perl-Time-Local-2: 100% | 68.9 MiB/s | 70.6 KiB | 00m00s [ 77/303] Installing perl-locale-0:1.13 100% | 0.0 B/s | 6.5 KiB | 00m00s [ 78/303] Installing perl-if-0:0.61.000 100% | 0.0 B/s | 6.2 KiB | 00m00s [ 79/303] Installing perl-Text-Tabs+Wra 100% | 23.3 MiB/s | 23.9 KiB | 00m00s [ 80/303] Installing perl-Pod-Simple-1: 100% | 112.3 MiB/s | 574.9 KiB | 00m00s [ 81/303] Installing perl-HTTP-Tiny-0:0 100% | 76.4 MiB/s | 156.4 KiB | 00m00s [ 82/303] Installing perl-Term-Cap-0:1. 100% | 29.9 MiB/s | 30.6 KiB | 00m00s [ 83/303] Installing perl-File-Temp-1:0 100% | 160.2 MiB/s | 164.1 KiB | 00m00s [ 84/303] Installing perl-IPC-Open3-0:1 100% | 27.8 MiB/s | 28.5 KiB | 00m00s [ 85/303] Installing perl-POSIX-0:2.23- 100% | 113.6 MiB/s | 232.6 KiB | 00m00s [ 86/303] Installing perl-Term-ANSIColo 100% | 96.9 MiB/s | 99.2 KiB | 00m00s [ 87/303] Installing perl-Class-Struct- 100% | 25.3 MiB/s | 25.9 KiB | 00m00s [ 88/303] Installing perl-podlators-1:6 100% | 18.5 MiB/s | 321.4 KiB | 00m00s [ 89/303] Installing perl-Pod-Perldoc-0 100% | 10.3 MiB/s | 169.2 KiB | 00m00s [ 90/303] Installing perl-File-stat-0:1 100% | 12.8 MiB/s | 13.1 KiB | 00m00s [ 91/303] Installing perl-Symbol-0:1.09 100% | 7.1 MiB/s | 7.3 KiB | 00m00s [ 92/303] Installing perl-SelectSaver-0 100% | 2.6 MiB/s | 2.6 KiB | 00m00s [ 93/303] Installing perl-Socket-4:2.04 100% | 59.7 MiB/s | 122.3 KiB | 00m00s [ 94/303] Installing perl-Pod-Usage-4:2 100% | 5.7 MiB/s | 87.9 KiB | 00m00s [ 95/303] Installing perl-overloading-0 100% | 5.4 MiB/s | 5.6 KiB | 00m00s [ 96/303] Installing perl-IO-0:1.55-520 100% | 74.0 MiB/s | 151.7 KiB | 00m00s [ 97/303] Installing perl-mro-0:1.29-52 100% | 41.7 MiB/s | 42.7 KiB | 00m00s [ 98/303] Installing perl-base-0:2.27-5 100% | 0.0 B/s | 13.0 KiB | 00m00s [ 99/303] Installing perl-Text-ParseWor 100% | 14.2 MiB/s | 14.6 KiB | 00m00s [100/303] Installing perl-Fcntl-0:1.20- 100% | 48.7 MiB/s | 49.9 KiB | 00m00s [101/303] Installing perl-Getopt-Long-1 100% | 71.9 MiB/s | 147.2 KiB | 00m00s [102/303] Installing perl-vars-0:1.05-5 100% | 0.0 B/s | 4.3 KiB | 00m00s [103/303] Installing perl-parent-1:0.24 100% | 10.7 MiB/s | 11.0 KiB | 00m00s [104/303] Installing perl-overload-0:1. 100% | 70.3 MiB/s | 72.0 KiB | 00m00s [105/303] Installing perl-Storable-1:3. 100% | 113.7 MiB/s | 232.8 KiB | 00m00s [106/303] Installing perl-constant-0:1. 100% | 26.7 MiB/s | 27.4 KiB | 00m00s [107/303] Installing perl-MIME-Base64-0 100% | 21.6 MiB/s | 44.3 KiB | 00m00s [108/303] Installing perl-Errno-0:1.38- 100% | 8.6 MiB/s | 8.8 KiB | 00m00s [109/303] Installing perl-File-Basename 100% | 14.3 MiB/s | 14.6 KiB | 00m00s [110/303] Installing perl-Scalar-List-U 100% | 48.4 MiB/s | 148.7 KiB | 00m00s [111/303] Installing perl-Getopt-Std-0: 100% | 11.5 MiB/s | 11.8 KiB | 00m00s [112/303] Installing perl-Encode-4:3.21 100% | 142.2 MiB/s | 4.7 MiB | 00m00s [113/303] Installing perl-DynaLoader-0: 100% | 31.7 MiB/s | 32.5 KiB | 00m00s [114/303] Installing perl-PathTools-0:3 100% | 60.1 MiB/s | 184.6 KiB | 00m00s [115/303] Installing perl-Exporter-0:5. 100% | 54.3 MiB/s | 55.6 KiB | 00m00s [116/303] Installing perl-Carp-0:1.54-5 100% | 15.5 MiB/s | 47.7 KiB | 00m00s [117/303] Installing perl-libs-4:5.42.0 100% | 143.8 MiB/s | 11.6 MiB | 00m00s [118/303] Installing perl-interpreter-4 100% | 7.3 MiB/s | 120.3 KiB | 00m00s [119/303] Installing perl-File-Find-0:1 100% | 41.5 MiB/s | 42.5 KiB | 00m00s [120/303] Installing perl-version-9:0.9 100% | 64.2 MiB/s | 131.5 KiB | 00m00s [121/303] Installing perl-File-Copy-0:2 100% | 0.0 B/s | 20.2 KiB | 00m00s [122/303] Installing perl-ExtUtils-Mani 100% | 42.2 MiB/s | 86.3 KiB | 00m00s [123/303] Installing perl-lib-0:0.65-52 100% | 0.0 B/s | 8.9 KiB | 00m00s [124/303] Installing perl-threads-1:2.4 100% | 57.2 MiB/s | 117.1 KiB | 00m00s [125/303] Installing perl-threads-share 100% | 41.9 MiB/s | 85.9 KiB | 00m00s [126/303] Installing perl-Compress-Raw- 100% | 80.8 MiB/s | 165.5 KiB | 00m00s [127/303] Installing perl-File-Compare- 100% | 6.0 MiB/s | 6.2 KiB | 00m00s [128/303] Installing perl-Time-HiRes-4: 100% | 57.5 MiB/s | 117.8 KiB | 00m00s [129/303] Installing perl-CPAN-Meta-Req 100% | 40.7 MiB/s | 83.4 KiB | 00m00s [130/303] Installing perl-Module-CoreLi 100% | 412.1 MiB/s | 1.2 MiB | 00m00s [131/303] Installing perl-Module-Metada 100% | 67.4 MiB/s | 69.0 KiB | 00m00s [132/303] Installing perl-Digest-SHA-1: 100% | 7.0 MiB/s | 115.0 KiB | 00m00s [133/303] Installing perl-Filter-2:1.64 100% | 32.5 MiB/s | 166.2 KiB | 00m00s [134/303] Installing perl-Module-Load-1 100% | 15.5 MiB/s | 15.9 KiB | 00m00s [135/303] Installing perl-Perl-OSType-0 100% | 33.5 MiB/s | 34.3 KiB | 00m00s [136/303] Installing perl-Term-ReadLine 100% | 17.4 MiB/s | 17.9 KiB | 00m00s [137/303] Installing perl-Tie-0:4.6-520 100% | 33.1 MiB/s | 33.9 KiB | 00m00s [138/303] Installing perl-Unicode-Norma 100% | 159.0 MiB/s | 488.4 KiB | 00m00s [139/303] Installing perl-meta-notation 100% | 0.0 B/s | 2.3 KiB | 00m00s [140/303] Installing perl-encoding-4:3. 100% | 146.9 MiB/s | 150.4 KiB | 00m00s [141/303] Installing perl-Net-Ping-0:2. 100% | 132.2 MiB/s | 135.3 KiB | 00m00s [142/303] Installing perl-ExtUtils-Comm 100% | 0.0 B/s | 10.2 KiB | 00m00s [143/303] Installing perl-Pod-Html-0:1. 100% | 2.9 MiB/s | 43.9 KiB | 00m00s [144/303] Installing perl-File-Which-0: 100% | 15.3 MiB/s | 31.4 KiB | 00m00s [145/303] Installing perl-AutoSplit-0:5 100% | 23.0 MiB/s | 23.6 KiB | 00m00s [146/303] Installing perl-Benchmark-0:1 100% | 36.0 MiB/s | 36.8 KiB | 00m00s [147/303] Installing perl-Test-Harness- 100% | 22.8 MiB/s | 583.6 KiB | 00m00s [148/303] Installing perl-CPAN-Meta-YAM 100% | 52.3 MiB/s | 53.6 KiB | 00m00s [149/303] Installing perl-Compress-Raw- 100% | 34.0 MiB/s | 69.6 KiB | 00m00s [150/303] Installing perl-IO-Compress-0 100% | 49.1 MiB/s | 1.0 MiB | 00m00s [151/303] Installing perl-IO-Zlib-1:1.1 100% | 26.1 MiB/s | 26.7 KiB | 00m00s [152/303] Installing perl-Devel-PPPort- 100% | 217.8 MiB/s | 892.1 KiB | 00m00s [153/303] Installing perl-DirHandle-0:1 100% | 424.9 KiB/s | 3.8 KiB | 00m00s [154/303] Installing perl-Dumpvalue-0:2 100% | 19.7 MiB/s | 20.2 KiB | 00m00s [155/303] Installing perl-ExtUtils-Cons 100% | 85.7 MiB/s | 87.7 KiB | 00m00s [156/303] Installing perl-ExtUtils-MM-U 100% | 3.6 MiB/s | 3.7 KiB | 00m00s [157/303] Installing perl-Hash-Util-Fie 100% | 62.8 MiB/s | 64.3 KiB | 00m00s [158/303] Installing perl-Hash-Util-0:0 100% | 55.1 MiB/s | 56.4 KiB | 00m00s [159/303] Installing perl-fields-0:2.27 100% | 0.0 B/s | 12.3 KiB | 00m00s [160/303] Installing perl-ExtUtils-Pars 100% | 28.0 MiB/s | 545.5 KiB | 00m00s [161/303] Installing perl-ExtUtils-Make 100% | 38.6 MiB/s | 750.3 KiB | 00m00s [162/303] Installing perl-ExtUtils-Inst 100% | 42.6 MiB/s | 87.2 KiB | 00m00s [163/303] Installing perl-devel-4:5.42. 100% | 136.0 MiB/s | 3.8 MiB | 00m00s [164/303] Installing perl-ExtUtils-Embe 100% | 15.7 MiB/s | 16.1 KiB | 00m00s [165/303] Installing perl-I18N-LangTags 100% | 81.8 MiB/s | 83.8 KiB | 00m00s [166/303] Installing perl-Locale-Makete 100% | 84.9 MiB/s | 173.9 KiB | 00m00s [167/303] Installing perl-Locale-Makete 100% | 13.2 MiB/s | 13.5 KiB | 00m00s [168/303] Installing perl-Params-Check- 100% | 27.9 MiB/s | 28.6 KiB | 00m00s [169/303] Installing perl-Module-Load-C 100% | 29.2 MiB/s | 29.9 KiB | 00m00s [170/303] Installing perl-IPC-Cmd-2:1.0 100% | 16.8 MiB/s | 85.9 KiB | 00m00s [171/303] Installing perl-Math-Complex- 100% | 84.0 MiB/s | 86.0 KiB | 00m00s [172/303] Installing perl-Math-BigInt-1 100% | 212.8 MiB/s | 1.1 MiB | 00m00s [173/303] Installing perl-JSON-PP-1:4.1 100% | 9.3 MiB/s | 143.6 KiB | 00m00s [174/303] Installing perl-CPAN-Meta-0:2 100% | 66.6 MiB/s | 613.8 KiB | 00m00s [175/303] Installing perl-NDBM_File-0:1 100% | 28.9 MiB/s | 29.6 KiB | 00m00s [176/303] Installing perl-SelfLoader-0: 100% | 0.0 B/s | 22.6 KiB | 00m00s [177/303] Installing perl-Sys-Hostname- 100% | 16.8 MiB/s | 17.2 KiB | 00m00s [178/303] Installing perl-Term-Table-0: 100% | 39.6 MiB/s | 81.1 KiB | 00m00s [179/303] Installing perl-Text-Balanced 100% | 110.2 MiB/s | 112.8 KiB | 00m00s [180/303] Installing perl-Tie-RefHash-0 100% | 36.5 MiB/s | 37.4 KiB | 00m00s [181/303] Installing perl-User-pwent-0: 100% | 17.5 MiB/s | 17.9 KiB | 00m00s [182/303] Installing perl-autouse-0:1.1 100% | 0.0 B/s | 6.4 KiB | 00m00s [183/303] Installing perl-subs-0:1.04-5 100% | 0.0 B/s | 2.5 KiB | 00m00s [184/303] Installing perl-Opcode-0:1.69 100% | 48.8 MiB/s | 50.0 KiB | 00m00s [185/303] Installing perl-Safe-0:2.47-5 100% | 30.3 MiB/s | 31.1 KiB | 00m00s [186/303] Installing perl-Params-Util-0 100% | 29.8 MiB/s | 61.0 KiB | 00m00s [187/303] Installing perl-Sub-Install-0 100% | 36.3 MiB/s | 37.2 KiB | 00m00s [188/303] Installing perl-Data-OptList- 100% | 51.0 MiB/s | 52.2 KiB | 00m00s [189/303] Installing perl-Filter-Simple 100% | 25.3 MiB/s | 51.7 KiB | 00m00s [190/303] Installing perl-Test-Simple-3 100% | 68.1 MiB/s | 1.8 MiB | 00m00s [191/303] Installing perl-Devel-SelfStu 100% | 0.0 B/s | 7.3 KiB | 00m00s [192/303] Installing perl-Memoize-0:1.1 100% | 65.2 MiB/s | 66.8 KiB | 00m00s [193/303] Installing perl-Math-BigInt-F 100% | 22.9 MiB/s | 46.9 KiB | 00m00s [194/303] Installing perl-bignum-0:0.67 100% | 66.6 MiB/s | 136.5 KiB | 00m00s [195/303] Installing perl-File-Fetch-0: 100% | 59.9 MiB/s | 61.3 KiB | 00m00s [196/303] Installing perl-ExtUtils-Mini 100% | 0.0 B/s | 8.8 KiB | 00m00s [197/303] Installing perl-inc-latest-2: 100% | 35.5 MiB/s | 36.3 KiB | 00m00s [198/303] Installing perl-libnetcfg-4:5 100% | 1.2 MiB/s | 17.3 KiB | 00m00s [199/303] Installing perl-DBM_Filter-0: 100% | 30.0 MiB/s | 30.7 KiB | 00m00s [200/303] Installing perl-File-HomeDir- 100% | 40.3 MiB/s | 123.8 KiB | 00m00s [201/303] Installing perl-open-0:1.13-5 100% | 0.0 B/s | 11.7 KiB | 00m00s [202/303] Installing perl-debugger-0:1. 100% | 197.4 MiB/s | 404.3 KiB | 00m00s [203/303] Installing perl-sigtrap-0:1.1 100% | 11.2 MiB/s | 11.5 KiB | 00m00s [204/303] Installing perl-Unicode-Colla 100% | 233.1 MiB/s | 4.2 MiB | 00m00s [205/303] Installing perl-Unicode-UCD-0 100% | 202.1 MiB/s | 206.9 KiB | 00m00s [206/303] Installing perl-Env-0:1.06-52 100% | 26.6 MiB/s | 27.2 KiB | 00m00s [207/303] Installing perl-Module-CoreLi 100% | 1.3 MiB/s | 19.3 KiB | 00m00s [208/303] Installing perl-Archive-Zip-0 100% | 17.1 MiB/s | 297.8 KiB | 00m00s [209/303] Installing perl-Thread-0:3.06 100% | 12.2 MiB/s | 12.5 KiB | 00m00s [210/303] Installing perl-Thread-Queue- 100% | 29.7 MiB/s | 30.4 KiB | 00m00s [211/303] Installing perl-Thread-Semaph 100% | 10.4 MiB/s | 10.6 KiB | 00m00s [212/303] Installing perl-experimental- 100% | 43.8 MiB/s | 44.8 KiB | 00m00s [213/303] Installing perl-Encode-devel- 100% | 7.1 MiB/s | 101.1 KiB | 00m00s [214/303] Installing perl-Pod-Checker-4 100% | 3.7 MiB/s | 53.5 KiB | 00m00s [215/303] Installing perl-diagnostics-0 100% | 23.0 MiB/s | 472.1 KiB | 00m00s [216/303] Installing perl-macros-4:5.42 100% | 5.6 MiB/s | 5.8 KiB | 00m00s [217/303] Installing perl-utils-0:5.42. 100% | 6.0 MiB/s | 98.6 KiB | 00m00s [218/303] Installing perl-Attribute-Han 100% | 39.6 MiB/s | 40.5 KiB | 00m00s [219/303] Installing perl-Config-Extens 100% | 3.2 MiB/s | 3.2 KiB | 00m00s [220/303] Installing perl-Config-Perl-V 100% | 26.9 MiB/s | 27.5 KiB | 00m00s [221/303] Installing perl-Devel-Peek-0: 100% | 43.8 MiB/s | 44.9 KiB | 00m00s [222/303] Installing perl-English-0:1.1 100% | 0.0 B/s | 6.7 KiB | 00m00s [223/303] Installing perl-File-DosGlob- 100% | 21.7 MiB/s | 22.2 KiB | 00m00s [224/303] Installing perl-FileCache-0:1 100% | 0.0 B/s | 7.9 KiB | 00m00s [225/303] Installing perl-FindBin-0:1.5 100% | 0.0 B/s | 7.2 KiB | 00m00s [226/303] Installing perl-GDBM_File-1:1 100% | 78.9 MiB/s | 80.8 KiB | 00m00s [227/303] Installing perl-I18N-Collate- 100% | 0.0 B/s | 7.6 KiB | 00m00s [228/303] Installing perl-I18N-Langinfo 100% | 35.3 MiB/s | 36.2 KiB | 00m00s [229/303] Installing perl-IPC-SysV-0:2. 100% | 37.4 MiB/s | 76.7 KiB | 00m00s [230/303] Installing perl-Module-Loaded 100% | 0.0 B/s | 5.6 KiB | 00m00s [231/303] Installing perl-NEXT-0:0.69-5 100% | 0.0 B/s | 24.0 KiB | 00m00s [232/303] Installing perl-Net-0:1.04-52 100% | 23.3 MiB/s | 23.9 KiB | 00m00s [233/303] Installing perl-ODBM_File-0:1 100% | 29.0 MiB/s | 29.7 KiB | 00m00s [234/303] Installing perl-PerlIO-via-Qu 100% | 31.4 MiB/s | 32.1 KiB | 00m00s [235/303] Installing perl-Pod-Functions 100% | 0.0 B/s | 14.8 KiB | 00m00s [236/303] Installing perl-Search-Dict-0 100% | 0.0 B/s | 5.2 KiB | 00m00s [237/303] Installing perl-Sys-Syslog-0: 100% | 47.3 MiB/s | 96.9 KiB | 00m00s [238/303] Installing perl-Term-Complete 100% | 0.0 B/s | 6.3 KiB | 00m00s [239/303] Installing perl-Test-0:1.31-5 100% | 36.5 MiB/s | 37.4 KiB | 00m00s [240/303] Installing perl-Text-Abbrev-0 100% | 3.5 MiB/s | 3.6 KiB | 00m00s [241/303] Installing perl-Tie-File-0:1. 100% | 84.1 MiB/s | 86.2 KiB | 00m00s [242/303] Installing perl-Tie-Memoize-0 100% | 0.0 B/s | 6.8 KiB | 00m00s [243/303] Installing perl-Time-0:1.04-5 100% | 10.7 MiB/s | 10.9 KiB | 00m00s [244/303] Installing perl-Time-Piece-0: 100% | 71.2 MiB/s | 72.9 KiB | 00m00s [245/303] Installing perl-blib-0:1.07-5 100% | 0.0 B/s | 3.6 KiB | 00m00s [246/303] Installing perl-deprecate-0:0 100% | 6.8 MiB/s | 7.0 KiB | 00m00s [247/303] Installing perl-doc-0:5.42.0- 100% | 235.0 MiB/s | 11.5 MiB | 00m00s [248/303] Installing perl-encoding-warn 100% | 10.4 MiB/s | 10.7 KiB | 00m00s [249/303] Installing perl-filetest-0:1. 100% | 0.0 B/s | 6.8 KiB | 00m00s [250/303] Installing perl-less-0:0.03-5 100% | 5.2 MiB/s | 5.3 KiB | 00m00s [251/303] Installing perl-perlfaq-0:5.2 100% | 180.1 MiB/s | 737.9 KiB | 00m00s [252/303] Installing perl-ph-0:5.42.0-5 100% | 91.7 MiB/s | 281.8 KiB | 00m00s [253/303] Installing perl-sort-0:2.06-5 100% | 5.1 MiB/s | 5.2 KiB | 00m00s [254/303] Installing perl-vmsish-0:1.04 100% | 6.8 MiB/s | 7.0 KiB | 00m00s [255/303] Installing perl-Compress-Bzip 100% | 70.9 MiB/s | 145.3 KiB | 00m00s [256/303] Installing perl-Devel-Size-0: 100% | 42.8 MiB/s | 43.8 KiB | 00m00s [257/303] Installing perl-Text-Glob-0:0 100% | 9.1 MiB/s | 9.3 KiB | 00m00s [258/303] Installing perl-local-lib-0:2 100% | 58.8 MiB/s | 120.4 KiB | 00m00s [259/303] Installing perl-IPC-System-Si 100% | 71.8 MiB/s | 73.5 KiB | 00m00s [260/303] Installing perl-autodie-0:2.3 100% | 71.3 MiB/s | 219.1 KiB | 00m00s [261/303] Installing perl-Compress-Raw- 100% | 60.2 MiB/s | 123.3 KiB | 00m00s [262/303] Installing perl-IO-Compress-L 100% | 71.7 MiB/s | 220.4 KiB | 00m00s [263/303] Installing perl-Algorithm-Dif 100% | 53.5 MiB/s | 109.5 KiB | 00m00s [264/303] Installing perl-Text-Diff-0:1 100% | 41.5 MiB/s | 85.1 KiB | 00m00s [265/303] Installing perl-Archive-Tar-0 100% | 9.6 MiB/s | 156.9 KiB | 00m00s [266/303] Installing perl-Module-Signat 100% | 9.0 MiB/s | 138.8 KiB | 00m00s [267/303] Installing perl-Text-Template 100% | 55.7 MiB/s | 114.0 KiB | 00m00s [268/303] Installing perl-MRO-Compat-0: 100% | 43.8 MiB/s | 44.9 KiB | 00m00s [269/303] Installing perl-Package-Gener 100% | 30.8 MiB/s | 31.5 KiB | 00m00s [270/303] Installing perl-Sub-Exporter- 100% | 49.3 MiB/s | 201.9 KiB | 00m00s [271/303] Installing perl-Data-Section- 100% | 43.0 MiB/s | 44.1 KiB | 00m00s [272/303] Installing perl-Software-Lice 100% | 83.5 MiB/s | 513.2 KiB | 00m00s [273/303] Installing perl-Module-Build- 100% | 32.4 MiB/s | 663.2 KiB | 00m00s [274/303] Installing perl-TermReadKey-0 100% | 32.3 MiB/s | 66.2 KiB | 00m00s [275/303] Installing perl-Error-1:0.170 100% | 39.0 MiB/s | 80.0 KiB | 00m00s [276/303] Installing git-0:2.50.1-2.fc4 100% | 85.2 MiB/s | 87.2 KiB | 00m00s [277/303] Installing perl-Git-0:2.50.1- 100% | 63.5 MiB/s | 65.0 KiB | 00m00s [278/303] Installing rocm-clang-0:19-15 100% | 67.8 MiB/s | 70.2 MiB | 00m01s [279/303] Installing rocm-clang-devel-0 100% | 89.5 MiB/s | 23.5 MiB | 00m00s [280/303] Installing rocm-device-libs-0 100% | 76.5 MiB/s | 3.2 MiB | 00m00s [281/303] Installing rocm-comgr-devel-0 100% | 48.6 MiB/s | 99.6 KiB | 00m00s [282/303] Installing hipcc-0:19-15.rocm 100% | 27.8 MiB/s | 654.3 KiB | 00m00s [283/303] Installing rocm-hip-0:6.4.2-2 100% | 249.5 MiB/s | 24.9 MiB | 00m00s [284/303] Installing libdb-0:5.3.28-66. 100% | 207.0 MiB/s | 1.9 MiB | 00m00s [285/303] Installing perl-DB_File-0:1.8 100% | 46.5 MiB/s | 190.6 KiB | 00m00s [286/303] Installing libstdc++-devel-0: 100% | 275.6 MiB/s | 37.5 MiB | 00m00s [287/303] Installing gcc-c++-0:15.2.1-1 100% | 255.3 MiB/s | 41.4 MiB | 00m00s [288/303] Installing perl-ExtUtils-CBui 100% | 33.2 MiB/s | 102.1 KiB | 00m00s [289/303] Installing perl-CPAN-0:2.38-5 100% | 75.8 MiB/s | 1.9 MiB | 00m00s [290/303] Installing perl-4:5.42.0-520. 100% | 121.1 KiB/s | 124.0 B | 00m00s [291/303] Installing rhash-0:1.4.5-3.fc 100% | 20.5 MiB/s | 356.4 KiB | 00m00s [292/303] Installing libuv-1:1.51.0-2.f 100% | 186.5 MiB/s | 573.0 KiB | 00m00s [293/303] Installing jsoncpp-0:1.9.6-2. 100% | 126.5 MiB/s | 259.2 KiB | 00m00s [294/303] Installing cmake-0:3.31.6-4.f 100% | 230.0 MiB/s | 34.5 MiB | 00m00s [295/303] Installing cmake-data-0:3.31. 100% | 50.1 MiB/s | 9.1 MiB | 00m00s [296/303] Installing rocm-cmake-0:6.4.0 100% | 66.2 MiB/s | 135.6 KiB | 00m00s [297/303] Installing hipify-0:6.4.1-3.f 100% | 134.2 MiB/s | 3.1 MiB | 00m00s [298/303] Installing rocm-hip-devel-0:6 100% | 110.9 MiB/s | 2.8 MiB | 00m00s [299/303] Installing rocm-rpm-macros-0: 100% | 19.0 MiB/s | 19.5 KiB | 00m00s [300/303] Installing rocm-smi-devel-0:6 100% | 138.7 MiB/s | 284.0 KiB | 00m00s [301/303] Installing rocm-core-devel-0: 100% | 15.8 MiB/s | 16.1 KiB | 00m00s [302/303] Installing annobin-plugin-gcc 100% | 35.3 MiB/s | 1.0 MiB | 00m00s [303/303] Installing gcc-plugin-annobin 100% | 196.1 KiB/s | 58.6 KiB | 00m00s Warning: skipped OpenPGP checks for 31 packages from repository: copr_base Complete! Finish: build setup for rccl-6.4.2-3.fc43.src.rpm Start: rpmbuild rccl-6.4.2-3.fc43.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1753747200 Executing(%mkbuilddir): /bin/sh -e /var/tmp/rpm-tmp.h6RcDl Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.LAMkQl + umask 022 + cd /builddir/build/BUILD/rccl-6.4.2-build + cd /builddir/build/BUILD/rccl-6.4.2-build + rm -rf rccl-rocm-6.4.2 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/RCCL-6.4.2.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd rccl-rocm-6.4.2 + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + sed -i -e '/AMD GPU targets to compile for/d' CMakeLists.txt + sed -i -e 's@cat ${ROCM_PATH}/.info/version@echo 6.4.2@' CMakeLists.txt + sed -i -e s@rocm-core/rocm_version.h@rocm_version.h@ src/include/hip_rocm_version_info.h + sed -i -e 's@if (ENABLE_MSCCLPP AND NOT(${HOST_OS_ID} STREQUAL "ubuntu" OR ${HOST_OS_ID} STREQUAL "centos"))@if (ENABLE_MSCCLPP)@' CMakeLists.txt + sed -i '/#include ' test/common/TestBed.hpp + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.y8j1Bg + umask 022 + cd /builddir/build/BUILD/rccl-6.4.2-build + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.4.2 + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + /usr/bin/cmake -S . -B redhat-linux-build -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DCMAKE_INSTALL_FULL_SBINDIR:PATH=/usr/bin -DCMAKE_INSTALL_SBINDIR:PATH=bin -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON '-DAMDGPU_TARGETS=gfx90a:xnack+;gfx90a:xnack-;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201' -DBUILD_FILE_REORG_BACKWARD_COMPATIBILITY=OFF -DBUILD_TESTS=OFF -DCMAKE_BUILD_TYPE=RelWithDebInfo -DCMAKE_C_COMPILER=/usr/bin/hipcc -DCMAKE_CXX_COMPILER=/usr/bin/hipcc -DCMAKE_EXPORT_COMPILE_COMMANDS=OFF -DCMAKE_INSTALL_LIBDIR=/usr/lib64 -DCMAKE_SKIP_RPATH=ON -DENABLE_MSCCLPP=OFF -DHIP_PLATFORM=amd -DRCCL_ROCPROFILER_REGISTER=OFF -DROCM_PATH=/usr -DROCM_SYMLINK_LIBS=OFF CMake Deprecation Warning at CMakeLists.txt:6 (cmake_minimum_required): Compatibility with CMake < 3.10 will be removed from a future version of CMake. Update the VERSION argument value. Or, use the ... syntax to tell CMake that the project requires at least but has been updated to work with policies introduced by or earlier. -- CMAKE_TOOLCHAIN_FILE: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/toolchain-linux.cmake -- The CXX compiler identification is Clang 19.0.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/hipcc - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") CMake Deprecation Warning at /usr/share/rocm/cmake/ROCMConfig.cmake:12 (message): Use of find_package(ROCM) is deprecated as of ROCm 6.4. Please use find_package(ROCmCMakeBuildTools) Call Stack (most recent call first): cmake/Dependencies.cmake:75 (find_package) CMakeLists.txt:55 (include) -- Checking for ROCm support for GPU targets: gfx906;gfx908;gfx90a;gfx942;gfx1030;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201 -- Performing Test COMPILER_HAS_TARGET_ID_gfx906 -- Performing Test COMPILER_HAS_TARGET_ID_gfx906 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx908 -- Performing Test COMPILER_HAS_TARGET_ID_gfx908 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1200 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1200 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1201 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1201 - Success -- Compiling for gfx906;gfx908;gfx90a;gfx942;gfx1030;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") CMake Deprecation Warning at /usr/share/rocm/cmake/ROCMConfig.cmake:12 (message): Use of find_package(ROCM) is deprecated as of ROCm 6.4. Please use find_package(ROCmCMakeBuildTools) Call Stack (most recent call first): cmake/Dependencies.cmake:75 (find_package) CMakeLists.txt:102 (include) -- ROCM_PATH found: /usr -- Compiling with hipcc -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory -- hipcc version: 6.4.43484 -- hipconfig executable: /usr/bin/hipconfig -- hipcc HIP version: 6.4.43484 -- ROCm version: 6.4.2 -- Looking for hipDeviceMallocUncached -- Looking for hipDeviceMallocUncached - found -- Looking for hipDeviceMallocContiguous -- Looking for hipDeviceMallocContiguous - found -- RCCL LL128 protocol enabled -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- RSMI_INIT_FLAG_THRAD_ONLY_MUTEX supported -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled -- Performing Test HAVE_PARALLEL_JOBS -- Performing Test HAVE_PARALLEL_JOBS - Success -- Parallel jobs enabled CMake Warning at CMakeLists.txt:331 (message): ROCTX library not found. Skipping ROCTX linking. -- Found Python3: /usr/bin/python3.14 (found version "3.14.0") found components: Interpreter -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.h -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp -- HIP_CONTIGUOUS_MEMORY enabled -- HIP_UNCACHED_MEMORY enabled -- Use 1 jobs for linking -- Building shared RCCL library -- rocm-cmake: Set license file to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/LICENSE.txt. -- Configuring done (48.2s) -- Generating done (0.1s) CMake Warning: Manually-specified variables were not used by the project: AMDGPU_TARGETS CMAKE_CXX_FLAGS_RELEASE CMAKE_C_FLAGS_RELEASE CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build + /usr/bin/cmake --build redhat-linux-build -j2 --verbose Change Dir: '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j2 /usr/bin/cmake -S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2 -B/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/CMakeFiles /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build//CMakeFiles/progress.marks /usr/bin/gmake -f CMakeFiles/Makefile2 all gmake[1]: Entering directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' cd /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' [ 0%] Built target git_version_check /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' [ 0%] Hipifying src/transport/shm.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc [ 0%] Hipifying src/bootstrap.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport/shm.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/bootstrap.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc [ 0%] Hipifying src/channel.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/channel.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc [ 0%] Hipifying src/collectives.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/collectives.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc [ 1%] Hipifying src/debug.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/debug.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc [ 1%] Hipifying src/device/all_gather.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/all_gather.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h [ 2%] Hipifying src/device/all_reduce.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/all_reduce.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h [ 2%] Hipifying src/device/alltoall_pivot.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/alltoall_pivot.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h [ 2%] Hipifying src/device/broadcast.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/broadcast.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h [ 2%] Hipifying src/device/common.cu -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/common.cu -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h [ 2%] Hipifying src/device/common.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/common.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h [ 3%] Hipifying src/device/common_kernel.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/common_kernel.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common_kernel.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common_kernel.h [ 4%] Hipifying src/device/msccl_kernel_impl.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/msccl_kernel_impl.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h [ 4%] Hipifying src/device/network/unpack/unpack.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack/unpack.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/network/unpack/unpack.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack/unpack.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack/unpack.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack/unpack.h [ 5%] Hipifying src/device/network/unpack/unpack_defs.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/network/unpack/unpack_defs.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h [ 5%] Hipifying src/device/onerank.cu -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/onerank.cu -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h [ 5%] Hipifying src/device/op128.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/op128.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/op128.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/op128.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/op128.h [ 5%] Hipifying src/device/primitives.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/primitives.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/op128.h [ 6%] Hipifying src/device/prims_ll.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/prims_ll.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h [ 6%] Hipifying src/device/prims_ll128.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/prims_ll128.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h [ 6%] Hipifying src/device/prims_simple.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/prims_simple.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h [ 6%] Hipifying src/device/reduce.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/reduce.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h [ 6%] Hipifying src/device/reduce_kernel.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/reduce_kernel.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_kernel.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h [ 7%] Hipifying src/device/reduce_scatter.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/reduce_scatter.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_kernel.h [ 7%] Hipifying src/device/sendrecv.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/device/sendrecv.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h [ 7%] Hipifying src/enqueue.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/enqueue.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h [ 7%] Hipifying src/graph/connect.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/connect.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc [ 7%] Hipifying src/graph/paths.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/paths.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc [ 8%] Hipifying src/graph/rings.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/rings.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc [ 8%] Hipifying src/graph/rings.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/rings.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.h [ 8%] Hipifying src/graph/rome_models.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/rome_models.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc [ 8%] Hipifying src/graph/rome_models.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/rome_models.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.h [ 9%] Hipifying src/graph/search.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/search.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc [ 9%] Hipifying src/graph/topo.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/topo.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc [ 9%] Hipifying src/graph/topo.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/topo.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h [ 9%] Hipifying src/graph/trees.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/trees.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/trees.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/trees.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/trees.cc [ 10%] Hipifying src/graph/tuning.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/tuning.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc [ 10%] Hipifying src/graph/xml.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/xml.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc [ 10%] Hipifying src/graph/xml.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/graph/xml.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h [ 10%] Hipifying src/group.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/group.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc [ 10%] Hipifying src/include/BfdBacktrace.hpp -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/BfdBacktrace.hpp -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp [ 10%] Hipifying src/include/alloc.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/alloc.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h [ 11%] Hipifying src/include/alt_rsmi.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alt_rsmi.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/alt_rsmi.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alt_rsmi.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alt_rsmi.h [ 11%] Hipifying src/include/api_trace.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/api_trace.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/api_trace.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/api_trace.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/api_trace.h [ 11%] Hipifying src/include/archinfo.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/archinfo.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/archinfo.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/archinfo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/archinfo.h [ 11%] Hipifying src/include/argcheck.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/argcheck.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h [ 12%] Hipifying src/include/bitops.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bitops.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/bitops.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bitops.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bitops.h [ 12%] Hipifying src/include/bootstrap.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/bootstrap.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h [ 12%] Hipifying src/include/channel.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/channel.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h [ 12%] Hipifying src/include/checks.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/checks.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/checks.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/checks.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/checks.h [ 12%] Hipifying src/include/coll_net.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/coll_net.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h [ 13%] Hipifying src/include/collectives.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/collectives.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h [ 13%] Hipifying src/include/comm.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/comm.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h [ 13%] Hipifying src/include/core.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/core.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h [ 14%] Hipifying src/include/cpuset.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/cpuset.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/cpuset.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/cpuset.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/cpuset.h [ 14%] Hipifying src/include/debug.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/debug.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/debug.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/debug.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/debug.h [ 14%] Hipifying src/include/device.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/device.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h [ 14%] Hipifying src/include/enqueue.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/enqueue.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h [ 15%] Hipifying src/include/gdrwrap.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/gdrwrap.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h [ 15%] Hipifying src/include/git_version.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/git_version.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/git_version.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/git_version.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/git_version.h [ 15%] Hipifying src/include/graph.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/graph.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h [ 15%] Hipifying src/include/group.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/group.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h [ 16%] Hipifying src/include/hip_rocm_version_info.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/hip_rocm_version_info.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h [ 16%] Hipifying src/include/ibvcore.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvcore.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/ibvcore.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvcore.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvcore.h [ 16%] Hipifying src/include/ibvsymbols.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvsymbols.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/ibvsymbols.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvsymbols.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvsymbols.h [ 16%] Hipifying src/include/ibvwrap.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/ibvwrap.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h [ 17%] Hipifying src/include/info.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/info.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h [ 17%] Hipifying src/include/ipcsocket.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ipcsocket.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/ipcsocket.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ipcsocket.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ipcsocket.h [ 17%] Hipifying src/include/msccl/msccl_kernel.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/msccl/msccl_kernel.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h [ 17%] Hipifying src/include/msccl/msccl_lifecycle.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/msccl/msccl_lifecycle.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h [ 17%] Hipifying src/include/msccl/msccl_parser.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/msccl/msccl_parser.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h [ 17%] Hipifying src/include/msccl/msccl_scheduler.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/msccl/msccl_scheduler.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h [ 18%] Hipifying src/include/msccl/msccl_setup.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/msccl/msccl_setup.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h [ 18%] Hipifying src/include/msccl/msccl_status.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/msccl/msccl_status.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h [ 18%] Hipifying src/include/msccl/msccl_struct.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/msccl/msccl_struct.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h [ 18%] Hipifying src/include/nccl_common.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_common.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nccl_common.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_common.h [ 18%] Hipifying src/include/nccl_net.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_net.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nccl_net.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_net.h [ 19%] Hipifying src/include/nccl_tuner.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_tuner.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nccl_tuner.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_tuner.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nccl_tuner.h [ 19%] Hipifying src/include/net.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/net.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h [ 19%] Hipifying src/include/net_device.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net_device.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/net_device.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net_device.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net_device.h [ 19%] Hipifying src/include/npkit/npkit.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/npkit/npkit.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h [ 20%] Hipifying src/include/npkit/npkit_event.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit_event.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/npkit/npkit_event.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit_event.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit_event.h [ 20%] Hipifying src/include/npkit/npkit_struct.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/npkit/npkit_struct.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h [ 20%] Hipifying src/include/nvmlwrap.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvmlwrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvmlwrap.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvmlwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvmlwrap.h [ 21%] Hipifying src/include/nvtx.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h [ 21%] Hipifying src/include/nvtx3/nvToolsExt.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExt.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h [ 21%] Hipifying src/include/nvtx3/nvToolsExtCounters.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtCounters.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtCuda.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtCudaRt.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtMem.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtMem.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtMemCudaRt.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtMemCudaRt.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtOpenCL.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtPayload.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtPayloadHelper.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtPayloadHelper.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtSemanticsCounters.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtSemanticsCounters.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtSemanticsScope.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtSemanticsScope.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvToolsExtSync.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h [ 24%] Hipifying src/include/nvtx3/nvtx3.hpp -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtx3.hpp -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp [ 24%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImpl.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtInit.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtInit.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtTypes.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h [ 29%] Hipifying src/include/nvtx_stub.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx_stub.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/nvtx_stub.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx_stub.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx_stub.h [ 29%] Hipifying src/include/p2p.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/p2p.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h [ 29%] Hipifying src/include/param.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/param.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/param.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/param.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/param.h [ 30%] Hipifying src/include/profiler.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/profiler.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h [ 30%] Hipifying src/include/proxy.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/proxy.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h [ 30%] Hipifying src/include/rccl_common.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/rccl_common.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h [ 30%] Hipifying src/include/rccl_float8.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/rccl_float8.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h [ 30%] Hipifying src/include/rccl_vars.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_vars.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/rccl_vars.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_vars.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_vars.h [ 31%] Hipifying src/include/register.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/register.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/register.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/register.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/register.h [ 31%] Hipifying src/include/rocm_smi_wrap.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/rocm_smi_wrap.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h [ 31%] Hipifying src/include/rocmwrap.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rocmwrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/rocmwrap.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rocmwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rocmwrap.h [ 32%] Hipifying src/include/roctx.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/roctx.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h [ 32%] Hipifying src/include/shm.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/shm.h [ 32%] Hipifying src/include/signals.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/signals.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/shm.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/shm.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/shm.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/signals.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/signals.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/signals.h [ 32%] Hipifying src/include/socket.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/socket.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/socket.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/socket.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/socket.h [ 33%] Hipifying src/include/strongstream.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/strongstream.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/strongstream.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/strongstream.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/strongstream.h [ 33%] Hipifying src/include/timer.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/timer.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/timer.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/timer.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/timer.h [ 33%] Hipifying src/include/transport.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/transport.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/transport.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/transport.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/transport.h [ 33%] Hipifying src/include/trees.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/trees.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/trees.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/trees.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/trees.h [ 34%] Hipifying src/include/tuner.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/tuner.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h [ 34%] Hipifying src/include/utils.h -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/include/utils.h -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h [ 34%] Hipifying src/init.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/init.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc [ 35%] Hipifying src/init_nvtx.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/init_nvtx.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc [ 36%] Hipifying src/misc/alt_rsmi.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/alt_rsmi.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc [ 36%] Hipifying src/misc/api_trace.c -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.c mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/api_trace.c -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.c && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.c [ 37%] Hipifying src/misc/api_trace.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/api_trace.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc [ 37%] Hipifying src/misc/archinfo.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/archinfo.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/archinfo.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/archinfo.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/archinfo.cc [ 37%] Hipifying src/misc/argcheck.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/argcheck.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc [ 37%] Hipifying src/misc/ibvsymbols.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/ibvsymbols.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc [ 37%] Hipifying src/misc/ibvwrap.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/ibvwrap.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc [ 37%] Hipifying src/misc/ipcsocket.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/ipcsocket.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc [ 37%] Hipifying src/misc/msccl/msccl_parser.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/msccl/msccl_parser.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc [ 38%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/msccl/msccl_lifecycle.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc [ 38%] Hipifying src/misc/msccl/msccl_setup.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/msccl/msccl_setup.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc [ 38%] Hipifying src/misc/msccl/msccl_status.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/msccl/msccl_status.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc [ 38%] Hipifying src/misc/npkit.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/npkit.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc [ 39%] Hipifying src/misc/nvmlwrap_stub.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/nvmlwrap_stub.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc [ 39%] Hipifying src/misc/param.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/param.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/param.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/param.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/param.cc [ 39%] Hipifying src/misc/profiler.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/profiler.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc [ 39%] Hipifying src/misc/rocm_smi_wrap.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/rocm_smi_wrap.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc [ 40%] Hipifying src/misc/rocmwrap.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocmwrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/rocmwrap.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocmwrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocmwrap.cc [ 40%] Hipifying src/misc/roctx.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/roctx.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc [ 40%] Hipifying src/misc/shmutils.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/shmutils.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc [ 40%] Hipifying src/misc/signals.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/signals.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/signals.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/signals.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/signals.cc [ 41%] Hipifying src/misc/socket.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/socket.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc [ 41%] Hipifying src/misc/strongstream.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/strongstream.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/strongstream.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/strongstream.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/strongstream.cc [ 41%] Hipifying src/misc/tuner.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/tuner.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc [ 41%] Hipifying src/misc/utils.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/misc/utils.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc [ 41%] Hipifying src/msccl.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/msccl.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc [ 41%] Hipifying src/net.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/net.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc [ 41%] Hipifying src/proxy.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/proxy.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc [ 42%] Hipifying src/rccl_wrap.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/rccl_wrap.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc [ 42%] Hipifying src/register.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/register.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc [ 42%] Hipifying src/transport.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc [ 43%] Hipifying src/transport/coll_net.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport/coll_net.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc [ 43%] Hipifying src/transport/generic.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport/generic.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc [ 43%] Hipifying src/transport/net_ib.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport/net_ib.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc [ 44%] Hipifying src/transport/net_socket.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport/net_socket.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc [ 44%] Hipifying src/transport/net.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport/net.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc [ 44%] Hipifying src/transport/nvls.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport/nvls.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc [ 44%] Hipifying src/transport/p2p.cc -> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/src/transport/p2p.cc -o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc cd /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8In file included from warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1201. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx1200. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for 8gfx942 warnings generated when compiling for gfx1100. . 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for host. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ :93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ :378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ :412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable]:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ :301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ :461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamIn file included from sSendRecv payload{count */builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISp ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cceed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ :486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ ; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ PayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ :461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * nc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), rclTypeSize(datatype), datatype}; | ^~~~~~~ oot, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cce:S7c: atIn file included from t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.he:r38 : pIn file included from ayl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.ho:a14d: {r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.he:c44v:c13o:u ntwarning: *unused function 'log2i' [-Wunused-function] ncclTypeSize(data t44y | pset)a,t iocp ,l odnagt altoygp2ei}(;l on g| ^~~~~~~n ) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ atatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] =/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 31 warnings generated when compiling for gfx90a. 1 warning generated when compiling for host. 31 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx942. 31 warnings generated when compiling for gfx1102. 31 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx906. 31 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1101. 31 warnings generated when compiling for gfx1030. 31 warnings generated when compiling for host. 1 warning generated when compiling for gfx1030. 31 warnings generated when compiling for gfx1200. 31 warnings generated when compiling for gfx1201. 31 warnings generated when compiling for gfx1101. 31 warnings generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1200. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** co 18 | static ncclRellComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return nccsult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm)lSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14 { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const : cwarning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | hstatic faloat ncrclTopoNV*LinkBw( int cudgcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, siaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ ze_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ ECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* compomrted) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t fun35 warnings generated when compiling for gfx1030. c, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ 35 warnings generated when compiling for gfx1201. 35 warnings generated when compiling for gfx1101. 35 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ 35 warnings generated when compiling for gfx1102. 35 warnings generated when compiling for gfx1200. 35 warnings generated when compiling for gfx908. 35 warnings generated when compiling for gfx942. 35 warnings generated when compiling for gfx90a. 35 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 35 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ :2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ :2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.ccpayload{myrank, nranks, cudaDev}; | ^~~~~~~ :1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ :2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSup/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ port(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuIn file included from ccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return nccl 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21:Success; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t off swarning: etunused function 'ncclTopoRankToIndex' [-Wunused-function], int fd, void** m h226a | nsdtlaet)i c{ nNcCcClLRCeHsEuClKt(_cto mnmc-c>lnTcocploCRoalnlkNTeotI-n>dreexg(MsrtDrmuacBtu fn(cccollTloCpoomSmy,s tdeamt*a ,s yssitzeem,, tiynpte ,r aonfkf,s eitn,t *f di,n dmehxa)n d{l e )| ) ^~~~~~~~~~~~~~~~~~~; return ncc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.hl:S237u:c21c:e swarning: sunused function 'ncclTopoDevToRank' [-Wunused-function]; } | ^~~~~~~~~~~~~~~~~~ 237 | st/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.ha:t25i:c21 :n cwarning: cunused function 'collNetDeregMr' [-Wunused-function]l Result_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRastatic ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, conk(struct ncclTopoSystem* system,n sti char* attrName, floant dev, int* rank) { t* val | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] ue262) | s{t a t| i ^~~~~~~~~~~~~~~c float ncclT/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.ho:p140o:X21G:M Iwarning: Sunused function 'xmlFindTag' [-Wunused-function]p eed(const ch a140r | *s tgactni)c {n c c| ^~~~~~~~~~~~~~~~~l Resu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.hl:t272_:t14 :x mwarning: lunused function 'ncclTopoNVLinkBw' [-Wunused-function]F indTag 272( | ssttruactti cn cfclloXamtl *n cxcmllT,o pcooNnVsLti nckhBawr(*i ntta gcNuadmaeC,o msptCraupc)t {n c c| l ^~~~~~~~~~~~~~~~ XmlN/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.ho:de283*:*13 :n owarning: dunused function 'isPow2' [-Wunused-function]e ) { | 283 ^~~~~~~~~~ | sta/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.ht:i152c: 21b:o owarning: lunused function 'xmlFindNextTag' [-Wunused-function] isPow2(int v a152l | )s t{a t i| c ^~~~~~ ncc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.hl:R286e:s12u:l twarning: _unused function 'mirrorBits' [-Wunused-function]t xmlF i286nd | NsetxattTiacg (isnttr umcitr rnocrcBliXtmsl(*i nxtm lv,a lc,o nisntt cphoawr2*) t{a g N| ^~~~~~~~~~a me, struct nccIn file included from l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.ccX:m40l: N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.ho:d76e:*21 :p rwarning: eunused function 'xmlAlloc' [-Wunused-function]v , struct ncc l76X | msltNaotdiec* *n cncoldRee)s u{l t _| t ^~~~~~~~~~~~~~ xml/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hA:l164l:o21c:( swarning: tunused function 'xmlFindTagKv' [-Wunused-function]r uct ncclXml** 164x | mslt,a tiinct nmcacxlNRoedseusl)t _{t x| m ^~~~~~~~ lFind/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hT:a111g:K21v:( swarning: tunused function 'xmlGetAttrInt' [-Wunused-function]r uct ncclXm l111* | sxtmalt,i cc onncsctl Rcehsaurl*t _tta gxNmalmGee,t AstttrruIcntt (nsctcrlucXmtl Nnocdcel*X*m lnNooddee, *c onnosdte ,c hcaorn*s ta tcthraNra*m ea,t tcroNnasmte ,c hianrt** avtatlruVea)l u{e ) | ^~~~~~~~~~~~~{ | ^~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h :118:21/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:: 180warning: :unused function 'xmlGetAttrIntDefault' [-Wunused-function]21: warning: unused function 'xmlFindNode' [-Wunused-function] 118 | s t180a | tsitca tniccc lnRcecsluRlets_utl tx_mlt GxemtlAFtitnrIdnNtoDdeef(asutlrtu(cstt rnucclXmlNode* parentNode,c struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | stat ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:tic ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSe133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const ch/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cca:r2563*: 26a:t twarning: runused variable 'payload' [-Wunused-variable]N ame, const ch ar2563* | v aNlvutex)P a{r a m| s ^~~~~~~~~~~~~~~~~C omm/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hI:n228i:t21R:a nwarning: kunused function 'xmlSetAttrInt' [-Wunused-function] payload{r a228n | ks,t antriacn knsc,c lcRuedsauDletv_}t; x m| l ^~~~~~~S etAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | stattAitct rnIcfcUlnRseestu(lstt_rtu cxtm lnScectlAXtmtlrNLoodneg*( sntorduec,t cnocncsltX mclhNaord*e *a tntordNea,m ec,o ncsotn scth acrh*a ra*t tvraNlaumee), {c o n| s ^~~~~~~~~~~~~~~~~t int64_t value) {/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h :| 228 ^~~~~~~~~~~~~~: 21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hwarning: :unused function 'xmlSetAttrInt' [-Wunused-function]267 :21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 228 | s267t | asttiact incc cnlResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ cclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, v/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ oid* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 57 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 57 warnings generated when compiling for gfx1101. 57 warnings generated when compiling for gfx1200. 57 warnings generated when compiling for gfx90a. 57 warnings generated when compiling for gfx1201. 57 warnings generated when compiling for gfx906. 57 warnings generated when compiling for gfx908. 57 warnings generated when compiling for gfx1030. 57 warnings generated when compiling for gfx1102. 57 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ edSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ :4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx942. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc 57 warnings generated when compiling for host. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/msccl.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h10: In file included from :15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: :unused variable 'y' [-Wunused-variable] 77 | 14 uin: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from t32_t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h y, h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | ead, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | :54N:38: warning: unused variable 'MscclSchema' [-Wunused-variable] v54 | constexpr tnvtxPayloaxdSchemaEntPry_t MscclSachema[] = r{ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:a61:19: warning: unused variable 'payload' [-Wunused-variable]msMsccl payl 61o | NvtxParad{count * ncclTypeSize(dataamsTMsccl payloyad{count p* nccle), TypeSize(dop, dataType}; | ^~~~~~~ataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(lo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.hn:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: gIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:n13: warning: unused function 'log2i' [-Wunused-function] 44) | static l ong log2i({lo n | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.ccg n) :{ | ^~~~~ In file included from 7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclRe/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:s21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | staticu ncclResullt_t t_tmscclXm msclclXmlGGetAttrInt(etAttrsInt(structruct mscclXmlNot mscclXmlNode* node, const chde* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t msccar*l aXttrNamme, intl* valuGe) { e| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | tstatic AncclRestult_t tmscclXmrlGetIAttrIntn64(strtuct64(struct mscclX msccmlXmllNode* nNodode* node, coe, connst chasr* t catthar* attrrNameN, int6ame, int64_t*4 _t* vavlue) { a | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] lue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct ms 89 | cstaticc nccllResultX_t ml* xml, const char* tmascclXmglFindTName, struagc(strtuct ms cclXmml* xml,s const cchar* ctagNamel, struXct mscmclXmlNoldNode*e** *node) {node) { | ^~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static * value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 7 warnings generated when compiling for gfx1200. 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for gfx1030. 77 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/rccl_wrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/rccl_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/rccl_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/rccl_wrap.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ a; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ :289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx942. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1200. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:25: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21:In file included from warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: */builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215: 21: warning: nunused function 'ncclTopoIdToIndex' [-Wunused-function] e215 | sttaticD nccleResuvlt_t) ncc lTop{oIdT oInd e| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.hx(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(str:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] u286ct | nccslTotpoSaysttem*i sycste m, iintn ratnk, imnit* rindrex)o { r | ^~~~~~~~~~~~~~~~~~~B /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.hi:237:21: warning: tunused function 'ncclTopoDevToRank' [-Wunused-function] s237 | s(tatiic nncclResutlt_t ncvclTaopolDev,T int pow2) { | ^~~~~~~~~~ oRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTIn file included from o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ poRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/rccl_wrap.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ 9 warnings generated when compiling for gfx906. 3 warnings generated when compiling for host. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/register.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/register.cc.o -MF CMakeFiles/rccl.dir/hipify/src/register.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/register.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx942. 9 warnings generated when compiling for gfx1030. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantis/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ sa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14 | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 22 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx1102. 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport.cc:2 warnings generated when compiling for gfx90a. 8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1102. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for host. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y,: 10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long2 warnings generated when compiling for host. n) { | ^~~~~ 2 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ issa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static{ | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvls float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mHeads, int nHeads) { | ^~~~~~~~~~~ irrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t iIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ d, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx1200. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx1201. 13 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagNa/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ me, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.ccdefaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrNam:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ e) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char*unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) {/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXml | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaComNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xmpCap)l { | ^~~~~~~~~~~~~~~~, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h: 283:13: cwarning: unused function 'isPow2' [-Wunused-function] 283o | static bool isPow2(nint vasl) { | t ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286: 12: warning: unused function 'mirrorBits' [-Wunused-function]c 286 | shtatic iant mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(str* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searcrhuct ncNclXmlNoode* nodde, coenst ch,ar* att rName,s const tint varlue) { u | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hc:241:21: warning: tunused function 'xmlSetAttrFloat' [-Wunused-function] 241 | s tatic ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, consncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static nt floact value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResuclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) {lt_ t xmlUnsetA ttr| (st ^~~~~~~~~~~~~~~~~ruc t n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hcclXml:Nod228e* node, :con21st :cha r* warning: attunused function 'xmlSetAttrInt' [-Wunused-function]rNa me) { 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNonode, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const floatde **v suab) l{ u| ^~~~~~~~~ e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305):21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function]{ 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int a | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.httrVa:lue267) {: | 21 ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h::312: 21: warning: warning: unused function 'xmlAddNode' [-Wunused-function] unused function 'xmlUnsetAttr' [-Wunused-function] 312 | static 267ncc | lRsesultt_ta xmtlAdidNocde( strnuct cnccclXmll*R xmel, sstruuctl nctclX_mlNtode * paxrenmt, lcoUnsetAttrnst char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] (struct ncclXmlNode* node, const char* attrName) { | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, c ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct 334 | nstatcicc nlcclXResmultl_t xNmolRdemoveeNo* node, const char* subName,d e(sstrtucrt uncccltXm lNnodce*c nlodXe)m { l | N ^~~~~~~~~~~~~ o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hd:347e:21:* warning: *'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] s347 | ustabti)c nc{cl Result_t xmlAddTree(struct ncclXml* ds t| ^~~~~~~~~, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hs:305t:21r: uwarning: cunused function 'xmlGetSubKvInt' [-Wunused-function]t 305n | scclXmlNode* parent, strounst char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* valuct nccltatic ncclResult_t xmlGetSubKvInt(struct ncclXmlNe) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] ode* node, const char* subName, struct ncclXmlNode**XmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 31 warnings generated when compiling for gfx1200. 31 warnings generated when compiling for gfx908. 31 warnings generated when compiling for gfx1030. 31 warnings generated when compiling for gfx1201. 31 warnings generated when compiling for gfx942. 31 warnings generated when compiling for gfx906. 31 warnings generated when compiling for gfx1101. 31 warnings generated when compiling for gfx1102. 31 warnings generated when compiling for gfx1100. 31 warnings generated when compiling for gfx90a. 31 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx906. 11 warning generated when compiling for gfx90a. warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx942. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa ; uint32_t y, head, mantis sa; | ^| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc::1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 13411341 | int nC:hannels = 70; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ : warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036::7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | 2036 int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xm/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ :1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316l, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ :9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode*nc*clXmlNode** sub) { | s ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21u: warning: unused function 'xmlRemoveNode' [-Wunused-function] b334 | static n,cclResul t_t xmlcRemoveoNode(stnruct nscclXmlNtode* nod char* attrName, const int e) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, structncclXm l* dst,n structc ncclXmlcNode* plarent, Xstruct mncclXmlNlode* srNcNode) {o | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hd:377:21:e warning: unused function 'kvConvertToInt' [-Wunused-function] 377* | static *ncclResu lt_t kvsConvertToub)Int(const char* str, int* value, struct kv { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.hDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ :347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* suIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ kBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xbName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ mlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct nccIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ lTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResulIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmt_t xmlFindTag(struct In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ lNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNodeIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncc* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ lXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 38 warnings generated when compiling for gfx942. 38 warnings generated when compiling for gfx1200. 38 warnings generated when compiling for gfx1101. 38 warnings generated when compiling for gfx1030. 38 warnings generated when compiling for gfx90a. 38 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, c38onst char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, con warningss generated when compiling for tgfx908. char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ 38 warnings generated when compiling for gfx906. 38 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: 38 warnings generated when compiling for gfx1201. warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ clResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ 18 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 38 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc 18 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/trees.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* commset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, , void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const chint64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ar* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], in/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResut nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | stlt_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdTatic ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return oNetDev(struct ncclncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | statLinkBw(int cudaComipCap) { c ess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const i| ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15n: t64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: ounused function 'collNetName' [-Wunused-function] 16 | sntatg logi2c consti c(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21:har* collNetwarning: Name(stunused function 'ncclTopoDevToRank' [-Wunused-function]ruct nc clCom m* co237 | static ncclmm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] Result_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h19 | static: nccl249Resul:t_t c21ollNe:tList en(stwarning: ructunused function 'ncclTopoIdToNetDev' [-Wunused-function] ncc lCo 249 | static ncclResult_t ncclTopoIdToNetDev(strucmm*t comm , intn dev, cvoidc* hanldle, Tvoid*o* lisptenComom) { SNCCLCyHECKstem(comm->ncclCollNet->listen(dev, handle, listenComm))* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* co; rleturnl ncclNSucceesst; } N| ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, voiame(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResuld** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSt_ut colclNetRcess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.hedu:ceS20upp:ort21(st:ruc t nwarning: ccunused function 'collNetConnect' [-Wunused-function]lCom m* comm, ncc20lDa | taTsypet_t adattaTyipec, n cclnRedcOpc_t rledORp,e isult_t collNetConnect(strunt* suIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* cpported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, ct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rerdOp,a sunppokrte,d)) ; rleturn ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandleomm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] istenComm, collComm)); return ncclSuccess; } | 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSuppo)r); tret(urns nctclSurcceuss;c } t | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h :n25:21:c warning: unused function 'collNetDeregMr' [-Wunused-function]c 25 | stlatiCc nocclmResmult*_t colclNeotDemremgMr,(st rucnt cclDataType_t dataType, ncclRedOncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet-p_t redOp, int* supported) { NCCLC:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* aHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static> dernecgMrc(colllCRoemms, muhanldlet));_ tret urnc noccllSulcceNss; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* senttrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrdData, void* reValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ etRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData,cvData, int count, ncclDataType_t dataType, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollN6e4_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* part->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.hent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ :31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:CHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); retu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode*22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int f* node) { | ^~~~~~~~~~~~~~ d, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, srn ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] truct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | st->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(atic ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, costruct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] nst char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xm 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ lSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenCIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclomm) { NCCLCHECK(comm->ncclCollNet->listen(dev, hanResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXdle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function]mlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrN 20 | static ncclResult_t collNetConnect(struct ame) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRECK(comm->ncclCoank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, intllNet->connect(handles, nranks, rank, listenComm, dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXmHECK(comm->ncclCollNet->reducel* xml, const char* tagName, struct nccSupportlXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFi(datandTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | staclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ tic ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 30 warnings generated when compiling for gfx90a. 30 warnings generated when compiling for gfx942. 30 warnings generated when compiling for gfx906. 30 warnings generated when compiling for gfx1030. 3030 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx908. 30 warnings generated when compiling for gfx1102. 30 warnings generated when compiling for gfx1100. 30 warnings generated when compiling for gfx1200. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc 30 warnings generated when compiling for gfx1101. 30 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ _LL_LIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.ccITS_:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces]U NDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: 107 | .lnote: lProtoRexpanded from macro 'RCCL_LL_LIMITS_UNDEFINED'anges = {RCC 35L_LL_LIMITS_UNDEFIN | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: ^~~~~~~~~~~~~~~~~~~~~~~~expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #de fine RCC L_LL_L| IMITS_UN { }DEFINE= {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ D 0 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFI | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #defiNED 0 | ^ ne RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc0:139: 21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | | .ll ^Proto Range/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.ccs = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34::171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .l lProtoRangesnote: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llPr= {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ otoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRangeIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ s = {RCCL_LL_LIMI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.ccTS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35::171:3421: warning: :suggest braces around initialization of subobject [-Wmissing-braces] 171 | note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #def .llProtoRanges = {RCCL_LL_LiIne RMCCL_ILL_LTIMITSS_UND_EFUNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ INED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' | { }35 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h: | 35:34: #note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' d 35 | #edefifne iRCCL_nLL_LeIMIT S_UNRDEFICNED 0C | ^L /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc_:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] L 171 | L .l_lProLtoRanIges M= {RCCL_LL_ITS_UNDEFINEDLIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35: 0 34| ^ : note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMIT/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.ccS_UNDEFINED 0 | ^ :235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc :203:R21: warning: Csuggest braces around initialization of subobject [-Wmissing-braces] 203C | .LllPro_toRaLngesL = {_RCCLL_LL_ILIMITMS_UNIDETS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] FI NED},235 | .llProtoR | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges =anges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #d e{RCCLf_LL_iLIMInTS_UeNDEF INRED},C | ^~~~~~~~~~~~~~~~~~~~~~~~C L_LL_LIMITS_UNDEFINE | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35D:34 : note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED'0 35 | #d efin| e ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.ccRCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' :267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 2150 | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc | static ncclResult_t ncclTopoIdToInd:267:3:ex (stwarning: rucnested designators are a C99 extension [-Wc99-designator]t 267 | .llProtoRanges/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ [RCCL_RS_TUNAnccBlTLopoESys]tem * ; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ =sys tem/, *intR typee, dintu64_ct ied, Sintcatter*/ {* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h/* :226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToILL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ nde/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.ccx(str:uct267 nc:c17lT:opo Syswarning: temarray designators are a C99 extension [-Wc99-designator]* s ys tem, i267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* nt rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ 237 | static ncclResult_t ncclTopLoL (MDin/eMaxv) *ToRank(struct ncclTopoSystem* s/y {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ stem, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ :171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc | :139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ .llPro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ toRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t nccl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ TopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static nccl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ Result_t ncclTopoIdToNetDev(struct ncclTopoSys/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ tem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ lTopoIdT/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.ccoIndex(struct ncclTopoSystem* system, int type, int/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ 64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ 237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSy/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:107:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 107 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ stem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:139:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 139 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ 283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ static float treIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ eCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:171:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 171 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:203:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 203 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:235:21: warning: suggest braces around initialization of subobject [-Wmissing-braces] 235 | .llProtoRanges = {RCCL_LL_LIMITS_UNDEFINED}, | ^~~~~~~~~~~~~~~~~~~~~~~~ | { } /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_common.h:35:34: note: expanded from macro 'RCCL_LL_LIMITS_UNDEFINED' 35 | #define RCCL_LL_LIMITS_UNDEFINED 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:3: warning: nested designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:267:17: warning: array designators are a C99 extension [-Wc99-designator] 267 | .llProtoRanges[RCCL_RS_TUNABLE] = /*ReduceScatter*/ {/* LL (Min/Max) */ {0, 655360, 1} , /* LL128 (Min/Max) */ {131072, 3211264, 1}}, | ^~~~~~~~~~~~~~~~~ float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:354:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 354 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:355:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 355 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:356:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 356 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:357:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 357 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:360:9: warning: unused variable 'ppn' [-Wunused-variable] 360 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ 27 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static nc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ clResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/tuning.cc:648:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 648 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx942. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx1201. 27 warnings generated when compiling for gfx1200. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int*In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, st value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, cruct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ onst char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ :233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/archinfo.cc /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ 6 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ 6 warnings generated when compiling for gfx1200. 6 warnings generated when compiling for gfx1102. 6 warnings generated when compiling for gfx90a. 6 warnings generated when compiling for gfx1100. 6 warnings generated when compiling for gfx1201. 6 warnings generated when compiling for gfx942. 6 warnings generated when compiling for gfx1101. 6 warnings generated when compiling for gfx1030. 6 warnings generated when compiling for gfx906. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc 6 warnings generated when compiling for host. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx942. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1201. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 11 warning generated when compiling for gfx90a. warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx942. 11 warning generated when compiling for gfx1102. warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 11 warning generated when compiling for gfx1101. warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx908. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/param.cc 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for host. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from 2 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx908. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocmwrap.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx942. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/signals.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/strongstream.cc /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SO/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ CKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: In file included from unused function 'log2i' [-Wunused-function] 44 | s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:t13: warning: unused function 'log2i' [-Wunused-function] a 44 | statict long log2ii(In file included from c long lolong gn) { | ^~~~~2i(lo ng n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for host. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1200. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for gfx90a. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for gfx1201. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx942. 15 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx1200. 4 warnings generated when compiling for gfx1201. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx942. 4 warnings generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc 15 warnings generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:143 warnings generated when compiling for gfx1201. : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3In file included from warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 11 warning generated when compiling for gfx1200. warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.hIn file included from In file included from :15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, vIn file included from oid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc*: d9a: tIn file included from a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h,: 13i: nIn file included from t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h :s15i: zIn file included from e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h,: 38v: oIn file included from i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.hd:*14 : mh/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.ha:n44d:l13e:, vwarning: ounused function 'log2i' [-Wunused-function]i d** request) { N44C | CsLtCaHtEiCcK (lcoonmgm -l>ongc2cil(CloolnlgN ent)- >{i f l| u ^~~~~s h(collCommIn file included from ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc :d10a: t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.ha:,17 :s21i:z ewarning: ,unused function 'collNetDevices' [-Wunused-function] mhandle, reques t17) | )s;t arteitcu rnnc cnlcRcelsSuulctc_ets sc;o l}l N e| t ^~~~~~~~~~~~~D evi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.hc:e29s:(21s:t rwarning: uunused function 'collNetTest' [-Wunused-function]c t ncclComm* co m29m | ,s tiantti*c nndcecvl)R e{sul NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ t_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx1100. 22 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx1200. 22 warnings generated when compiling for gfx1101. 22 warnings generated when compiling for gfx1030. 22 warnings generated when compiling for gfx1102. 22 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint3In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h2_t y, head, :15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResIn file included from u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_lt_t ncclTt ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struopoIdToIndex(struct ncclTocpt ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int oSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncval, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ clTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn)In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: {unused function 'mirrorBits' [-Wunused-function] | ^~~~~~~~~~~~~~~~~ 286 | static int mi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.hr:r272o:r14B:i twarning: sunused function 'ncclTopoNVLinkBw' [-Wunused-function]( int val ,272 | isntta tpico wf2l)oa t{ n c| c ^~~~~~~~~~l TopoNVLinkBw/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc(:i285n:t21 :c uwarning: dunused function 'netDumpMap' [-Wunused-function]a CompCap) 285{ | s t| a ^~~~~~~~~~~~~~~~t ic n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.hc:c283l:R13e: swarning: uunused function 'isPow2' [-Wunused-function]l t_t n e283t | DsutmaptMiacp (bsotorlu cits Pcoown2n(eicnttM avpa*l )m a{p ) | { ^~~~~~ | ^~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h :286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 16 warnings generated when compiling for gfx1030. 16 warnings generated when compiling for gfx1200. 16 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx1101. 16 warnings generated when compiling for gfx908. 16 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx942. 16 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ har* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 24 warnings generated when compiling for gfx90a. 24 warnings generated when compiling for gfx1102. 24 warnings generated when compiling for gfx906. 24 warnings generated when compiling for gfx1030. 24 warnings generated when compiling for gfx1201. 24 warnings generated when compiling for gfx1101. 24 warnings generated when compiling for gfx942. 24 warnings generated when compiling for gfx1100. 24 warnings generated when compiling for gfx908. 24 warnings generated when compiling for gfx1200. 16 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc 24 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from :8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static longIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1030. warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(lonIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.ccg n) { | ^~~~~ :10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx908. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, intIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(st:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ ruct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:215:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 215 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:226:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 226 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:237:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 237 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:249:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 249 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:262:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 262 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:272:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 272 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:283:13: warning: unused function 'isPow2' [-Wunused-function] 283 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/graph/topo.h:286:12: warning: unused function 'mirrorBits' [-Wunused-function] 286 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from 10/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from 10 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for host. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: In file included from note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from uint32_t y, he/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: aIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:d15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:,77:18: warning: unused variable 'y' [-Wunused-variable] 77 | m uinta3nti2_t y, ssa; | ^ head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14In file included from : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadint w I= threadIddx.x/WARP_SIxZE; \ | ^ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h: | 11 ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145: :21: warning: unused variable 'flag1' [-Wunused-variable] In file included from 145 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h uint3:2 _ | ^~~~~ 174/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.ht:145:28: : warning: unused variable 'data2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h145 | d uinat32_t d:tata1, aflag1, 1data2,, flag2 ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hf:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from dIdx.x/WARP_S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2I: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11Z: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: E/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: ;warning: unused variable 'w' [-Wunused-variable] 80 | \ barrie In file included from | ^ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threIn file included from adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.chaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->cha/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hnnelLo; | ^~~: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ 366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hd:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ n nelId= - wor k->channnelLo;c | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId -/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11w: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.ho:75:7: warning: unused variable 'w' [-Wunused-variable]r 75 | k ba-rrier_>by_grocup(); h| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:a29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from nnelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3:366:215: warning: unused variable 'bid' [-Wunused-variable] _ 366 | t co nst indt bid a= ncclSthmem.cahannel1Id -, work- >chanfnelLol; | a ^~~ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ onst int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:171:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 171 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RuIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllGather_RING_LL128_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buff/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hS:218:15: warning: unused variable 'bid' [-Wunused-variable] izes[NCCL_P 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRingchannelLo; | ^~~ _UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from IMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp::2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:67011: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h::174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:760: warning: unused variable 'w' [-Wunused-variable] :75 | barrinote: er_by_field 'group' will be initialized after field 'stepSize'group (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 670 29 | | cons t int w = thr eadIdx tid(tid), nthreads(nthreads), tidInBlock(th.x/WARrP_SIZE;e \ a| ^ dIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int b/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ id = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, ntIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ hreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | NG_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] t366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFunIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMcAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlockP(LE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:171:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 171 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllGather_RING_LL128_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58reads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDev:F56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.head((t:id%4)==3)173, group(g: roup), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h| ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499: | st670epSize(:nc15: warning: initializer order does not match the declaration order [-Wreorder-ctor] clS hmem.comm.buffSize670 | tid(tid), nthreadss[NCCL(_PROTO_nLL128]/tNCCL_SThEPreaSd/sizs), tidIeof(uninBlock(threadIdxt6.4_t)) {x), | g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.houp:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthr(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MAX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | eads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL12ff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 8]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] fSizes[NCCL670 | _PROTOt_Sid(tIMPLE]/NCCL_STEPS/sizeoidf(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | PrimIn file included from ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0itives ? ncclShmem.comm.buffSizes[NC, /*ODirect=*_/0, ProSto, 0>I prims M | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hP:565:LE]/NCCL_STEPS/sizeof(T) : stepSize_) 5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | PrimitivesA(tid,R nthIreadsT, worYk); >| ^ ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :432:78: /note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here *432 | D irect/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ =*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565if :(tid <5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 subtn) | Run Work Coll p().rDun(own, 0, 2, 2>::run' requested here 7 | DEFINE_nc RedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); c| lD ^evFu nc(A/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hllReduce_:TREE_432SIMP:LE_78MinM:ax_ note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here bf16_2432 | , inccflFu (tid < subtn) RunWorkCollROTO(_SI)MPL.rEu, n(tid, s2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncrcedlopD, valgFo, protou,nc(AllReduce u_nroTll>R().ErunE();_ \ S | I ^ M/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hP:670:LE_MinMax_bf16_2, ncclFuncAllReduce, Fun15c: note: Mfield 'nthreads' will be initialized after field 'tidInBlock' i670 | n M tiad(tid), nxthr,eads (nthhreiadsp), _tidbInBlfockl(thorat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hu:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | seads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDow note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ X_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Pronote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTONROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | _SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work, g)roup;(gro up), | | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 ^: 60: note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hfield 'group' will be initialized after field 'stepSize' 670 | tid(tid), :nthr432eads:(nth78read:s), t idInote: nBloin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereck(t hrea dIdx.x),432 group(group), | | ^~~~~~~~~~~ if (ti | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch<, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupcoll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn,id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : st work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | epSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINEp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, w_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffS(tiid), nthzreads(ntehreads)s, tidIn[Block(tNhreadIdCx.x), gCroup(grLoup), | _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_P 671 | R stepSOize(stepTSize_ =O=_S I0 MPLE]/NCCL_ST? ncclEShmem.cPSomm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) {/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h: work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h s:670:u15: warning: initializer order does not match the declaration order [-Wreorder-ctor] b670 | t tid(tnid), n,threads (nthrwork); | ^eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp.:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DcEFINE_nocclDevmFunc(AlmlReduc.e_RINGb_LL128u_MinMafx_bf16_f2, nScclFunicAllRedzuce, FeuncMisnMax, hip_bf[loNCCat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) L_PROTO_SIMPLE]/NCCL_STE PS/s| izeof(^T) : s tepSiz/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611e_) :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, algo, proto, unroll>().run(); \ | ^ p, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Mi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hnMax_bf16_2, n:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitiv| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tes, s/*Di(rnthreads)ect,=*/0 , Prtoto,i 0>dInBlock(threadIdx.x), group(group prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTr), e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ eUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid),/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.co 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hp:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizk(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | e_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, n>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channel/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Lo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadId/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = nccx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] lIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ Shmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hzes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hwarning: :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPinitializer order does not match the declaration order [-Wreorder-ctor]LE _MinMax_bf8_2, ncclFunc A670l | l R e du ctei,d (Ftuindc)M,i nnMtahxr,e ardcsc(ln_tbhfrleoaadts8),, NtCiCdLI_nABLlGoOc_kT(RtEhEr,e aNdCICdLx_.PxR)O,T Ogr_SoIupM(PgLrEo,u p2)), | | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 671 | s t611e | p S i z eR(usntWeoprSkiBzaet_c h=<=c o0l l?, ntcyc,l Srhemdeomp.m,. baulfgfoS,i zperso[tNoC,C Lu_nPrRoOlTlO>_(S)I.MrPuLnE(])/;N C\C L _| S ^T EPS/siz/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.he:o670f:(15T:) note: :field 'nthreads' will be initialized after field 'tidInBlock' stepSize _670) | { | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d (| t group(groupi d), nthreads(nthreads)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h,: 254t:i90d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herel ock(threadI d254 | x . x ) , g rPoruipmi(tgirvoeusp<)T,, R| e ^~~~~~~~~~~~~~~~~d Op,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :F670a:n60A:s ynote: mfield 'group' will be initialized after field 'stepSize'm etricr,e a/d*sD(inrtehcrte=a*d/s0),, PtriodtIon,B l0o>c kp(rtihmrse a d| I ^d x.x), g/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hr:o565u:p5(: gnote: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hereo up), | ^~~~~~~~~~~ 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreadsCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)=/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h=3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h : stepSize_) {/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives670:15: ,warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t0id(tid,), :n warning: initializer order does not match the declaration order [-Wreorder-ctor] tP 670 | rh tidor(tid)te, nthraeads(dnths(nthreads), tidInBlock(threadIdx.o, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing671( | s st(epSizet(stepSiitze_ =de= p0, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hSi:ze_ == 4320 ? :ncclShmem78.comm.:buffSi ze s?note: nccl[in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereShmem .coNmm.bCuffSizC es[NLCCL_PR432_OT | O _SP RI OMPT O _LiSEIf] (tid < subtn) RunWMPLE]/NCCL_STEPS/sizeof(T) : ste/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:orkColl()pSi.ze_)r { u| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n| group(group( ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:d90: note: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303s | u Pbrimittiv303n:e90: ,snote: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here < wT303o | , r k RPri)em;diti Ove sp<| T,, ^Re d OFp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cppa, FnaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().rnAAsymme:sy12mt:m1er:ti rnote: iin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herec,A /*lDirlec NRt=CCL_MA*X/_0DEe,Vd_ARIT Y>, /uP*Dircreeoc_ttRo=*,/I 00N,G_SIMPLE_> Mprimsi | ^n /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hMa Pr:oto565, 0:x> p5rim:s_ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hnote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | un(); \ | ^ rbf8_unTr2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL _ rAunTrLeeUpDGownC, eCLeUpO_DowLPn(tid, nthreads, wor/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hk:), R611;edO :p, 62 | Pr:o ^to Sim plnote: e<1expanded from macro 'DEFINE_ncclDevFunc', 1, COLL_UN611RO/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h | LL >, COLL _UN: ROL432RL>(:tid, n78thu:readn s,Wnote: wooin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hererk)r ; k | ^B a432 | t c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_h, aalgox, pr_oto, bunrofll>().run();8 \ _| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h4:,670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' n 670 | c ctid(tlid),F nthureadns(ncthrAealds),l tidRInBloecdk(thureadcIdx.ex), group(gr,oup ) F ,ifu (tn id c| ().run(tid, subtinnMax,, rccl_bflo at8, wNCCLo_ALGrO_TRkEE), NC;CL_P ROTO _SIM| PLE, ^4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h: 611670 | : 62 ti:d(t id),note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cppexpanded from macro 'DEFINE_ncclDevFunc' nth :read 17s(611nt | : hr1 ead:s ) ,note: tRiin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested heredIun BlnoWck(othre rkadIdBx.x)a, gr17oup( | tgroDucp)E,hFINE_nc, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hlReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL:670:15_: note: field 'nthreads' will be initialized after field 'tidInBlock'P R670O | T tidO(tid)_, ntShreaIds(nMthrePadLs), tEidIn,Bloc k(th4read)Idx. x), gro| up(g^r oup)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h,: 611 | : ^~~~~~~~~~~~~~~~~ 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlo, grcoup(gkroup(), t| ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, nc17clFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78:oto, COLL_UNROLL>().run(tid, subtn, wo note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | rk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RIN/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ G_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hnote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0>:670 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] p 670 | r tid(tid), nthreads(nthreads), tiims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); dIn Block| (threadIdx.x), ^gro up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h== 0 ?: nccl432:Sh78m:e note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWm.ocomm.rbuffSkizes[CNCCL_oPROTO_lSIMPLlE]/N().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DizEe_) {F | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ I | group(group N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:E56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here _ 63 | n Prcimitivesc, u0n,c (PAllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReducroto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid:, nthreads611, wo:rk);62 | ^: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78:note: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here expanded from macro 'DEFINE_ncclDevFunc' 432 | if (ti611d < | su btn) RunW orkC oll, algo, proto, Al,go, uPronto, COLL_UNROLL>/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ roll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cppr:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereo u 22 | DEFINE_pnccl)DevF,unc( All Red| uce_ ^~~~~~~~~~~~~~~~~RING _SIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hPLE_MinMax:_bf8670_4, :ncclFuncAll60Reduc: e, Funote: ncMifield 'group' will be initialized after field 'stepSize'nMax, rc cl_bfloat6708, | NCC L_A LGO_ RING , tNiCdC(L_tiPdR)OT,O _nStIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:ncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_ note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_unWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(t 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx908. 1717 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.c->channelLho; | ^~~ annelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 = threadIdx.x/WARP_SIZE; \ | ^ | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dataIn file included from 1, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hf:75:7: lwarning: unused variable 'w' [-Wunused-variable] a75 | g barrie1r_by_gr,oup(); data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 29 | 75 | barrier_by_grouconst p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = thIn file included from readIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShm/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ em.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t da/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hta1, flag1, :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId -data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from :28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35 work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; In file included from | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = nIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ cclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - woIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rk->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const inIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId -In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_ runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ PROTO_SI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FunMPLE, 2) | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nth/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' cMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread611 | RunWorkBatchI, algo,d proto,x unroll>.().runx()reads(nthreads), tidInBl;ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ \), group(g | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiz, group(egroup_), | ) ^~~~~~~~~~~~~~~~~ {/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 :60: note: field 'group' will be initialized after field 'stepSize' 670 | | tid(t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id), nt hreads(nthread| s), t group(groupidInB lock(threadIdx./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hx):63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | , group(group), | ^~~~~~~~~~~ Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) In file included from | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWork/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ Batch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | In file included from runTre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ntheSplita(tidds, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepS().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMaizex_) {, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ h| group(groupa /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: lnote: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here f254 | , Primitives_, /*PDRiOrTeO_LctL=*/012,8, P2roto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:565611:62 | : note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWo rkBratuch, algo, proto, unroll>().run(); \ | ^ nTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrea d| group(group s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303):90: ,note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives(, /*Dtirect=h*/0, rProteo, 0>a primds Id| ^x ./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hx), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_eUNROL_L>)(ti d, nt{hread s, wo rk); | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h: 432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here | 432 | group(group if (tid < su/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hbt:n254): R90:un Wnote: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | PrimitivesorkColl, /*Direct=*/0, Proto, 0> prims | ^ Proto, C/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hOLL_UNROLL>().r:un(ti565d, sub:tn, w5ork);: | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:note: 1: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested herenote: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DE FINE_ncclDevFunc565(Al | lR educe _ TR rEEunT_SrIeMeUPpDLoE_wMni, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, COLL_UNROLL>(L_Mtid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloIn file included from ck(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173r: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:e warning: initializer order does not match the declaration order [-Wreorder-ctor] a d670 | tiId(tid), nthredads(nthxreads.), tidIxnBl)ock(th,rea dgroup(grouIdpx.)x), g,roup( group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st | e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ p 671 | stepSizeS(stepSiize_ z==e(s 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, /o*to,D COLL_UNROLL>(itid, nthreadrecst=*/0,, P rowto, 0o> prims | ^r k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | :432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | runTreeUpDown, OCOLpL_U,NR OLALl>g(oti,d P,r notthor,ea CdOsL, Lw_oUrNk)RO;L L>(| ^ )/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h.run(:t432:i78d: ,note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested heres u bt432n | , w o rifk )(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h;t i | d < ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here su/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFbtIn) RNunWoErkCol_l, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReducen, T,c RedcOp, lAlgo, ProtDo,e vFunc(AllReCOLL_UNROLL>().run(tid, subtn, work);duce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_A L | ^ GO_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cppR:I7:1N: note: Gin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here ,7 | DENFICNE_ncclDCevLFunc_(AlPlRRedOuce_TO_TRSIMPLEEE,_SI MPL2E)_ M | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: innote: Max_expanded from macro 'DEFINE_ncclDevFunc'f16_ 2, ncc_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lFunc611 | Al RunWorkBatchlReduce,< FucncMoinMlax, lhal,f, NCCL_ALGO_TREE, NCCL_PROTO_SI ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hM:P670LE,: 215) : | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hnote: field 'nthreads' will be initialized after field 'tidInBlock':611 :62 : note: 670 | tid(tid), nthreadsexpanded from macro 'DEFINE_ncclDevFunc'( n 611 | t h RurnWeorkBaatdchs,n Balglo,ock (ptrhorteoa,d Idxunro.llx)>, ()g.rrounup()(;g r\o up), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:| ^ note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreadfield 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threas(dnthIreaddsx), .tidxInB)loc, k(tghreraoupdIdx.x(), ggrourp(goroupu), p), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78In file included from : note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)_Mi nMax_f 16_4, ncclFuncAll497Reduce, | Fun cMinMax , ha lf, NwCarpInBlock(threadIdx.CL_ALGO_TREE, NCCL_PROTOx_SI/MWPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, pARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here roto, unroll>().run()10; \ | ^ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hD:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'E F670 | tiId(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), NE| _ncclDe ^~~~~~~~~~~~~~~~~vF unc(A/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hllReduce_RING:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(_LtL128_MiinMax_fd16_2, n)c,c lnthreads(nthreadsFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWor), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ kBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> pr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:ims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558: 5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested herewarning: 558 | initializer order does not match the declaration order [-Wreorder-ctor] runR ing< T, RedOp, P670roto, CO | LL_UN ROLL> ( ti dt,i dn(ttid), nthreads(nthreads), htreadsi, workd); I| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:n432:78B: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here l432 | o ckif(t (htreadIdx.x), groupid < subtn() RunWorkCollS().runT(tid, Esubtn,P Swork);/ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpps:12:i1:z note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here e 12 | DoEFIfNE_nc(clDeT)vF u:n cstepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group (AllReduce_RING_SIMPLE_MinMax_f16/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | _2, n cclFun cAllRe duce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' RedOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWo 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, o, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | 1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCC.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (L_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl()./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFIrun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LNE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] p, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMa/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().rx_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tiun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, w:670:15: warning: oinitializer order does not match the declaration order [-Wreorder-ctor] r670 | ktid(tid)), nth;reads( nthrea ds), t| idI ^nBlock( thread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hIdx.x), group(grou:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSize ifs (tid[ < suNbtn)C RunWorkCColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDMPeLE]/NvCCL_STEFPS/sizueof(nc(AllReducT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432NROL:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ =*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> primdIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ s | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(th 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barri/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2e: r_bIn file included from y/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group': In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group (29 | cons)t int; w = th readI| dx.x ^~~~~~~~~~~~~~~~~~/ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hW:A29R:P15:_ Snote: Iexpanded from macro 'barrier_by_group' 29 | const int w = thZrE; \ e| ^ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hg2;: 11| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h: :145In file included from :21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hwarning: unused variable 'flag1' [-Wunused-variable] :145173 | : u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hin:t3275_t d:ata1,7 f:l agwarning: unused variable 'w' [-Wunused-variable]1, 75In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | barrierdata_2, fblayg2; _ | g ^~~~~ r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.houp(:145:)28:; warning: unused variable 'data2' [-Wunused-variable] | 145 | uin ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from P_SIZE; \ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp: ^2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:21: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cppunused variable 'flag1' [-Wunused-variable]:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :11 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174145: | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145 :14: warning: unused variable 'data1' [-Wunused-variable] u int3145 | 2 _t duaita1nt3,2 flag1, data2, _tf datal1,a fglag12, da;t a2 | , ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hfla:145:g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag128: ,warning: unused variable 'data2' [-Wunused-variable] 145 | d uiant32t_t daata1,2 flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_tuint32_t data1 dat,a1, ffllag1, data2,ag f1l,a dgat2a;2 , f lag2;| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid =:145: 35: nwarning: unused variable 'flag2' [-Wunused-variable] c 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ clShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:In file included from 2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: :In file included from 175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h80::1755:: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h warning: unused variable 'w' [-Wunused-variable]:80:5: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_ g80 | r o baurrip(er_b)y_gr;oup( ); | ^~~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :29/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:15: :note: 29:15:expanded from macro 'barrier_by_group' 29note: expanded from macro 'barrier_by_group' 29 | | con st c iont nstw i= nt w = ththrreadeIaddx.x/WARP_SIZE; \ | ^ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | c oconnsstt iinnt bid = ncclShmem.channetl bid I= ncdcl Shm-em.c hannwelIdo - wrork->kc-ha>nchnaenlnLeol;L o;| ^~~ | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | :218: 15: warning: unused variable 'bid' [-Wunused-variable] const in 218 | t co nbstid =i nntc cbilSdh m=em .ncchclShmemannelId - work->channelLo; | ^~~ .channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->chann/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.he:366:15: warning: lunused variable 'bid' [-Wunused-variable] L 366 | o ;const i nt b| id ^~~= ncclShme m.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218onst int w = threadIdx.x/WARP_SIZE; \ | ^ :15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->cha/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nnelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | 670 tid(tid), nthreads(nthreads), tidInBlock(th | rtid(tid), nthreeads(ntahreadds), tIidInBdlock(txhre.adIdxx.x), group(group), | ^~~~~~~~~~~), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? n cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | In file included from ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp group(group:2 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thre:254a:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here d 254I | d x.x ) Prim, girtoiuvpe(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s, /*Direct=*/0, Proto, 0i>ze( sptepSrize_ =i= 0 ms? ncc lShm| em. ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hcomm:.bu565f:fSi5zes[N:CC note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here L_565P | ROTO_ runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEtoSimple<1, 1FINE_ncclDevFu, COnLL_UcNRO(LL>, ACOLLl_UNROlLL>(tRide,d uncte_hrTeRaEdEs_SIM, wPoLE_rkM);i nMax_f32_ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(2,t nciclFundcAllR,educe , FunscMiunMax,b floatt, NCCnL_AL,GO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hads), tidInBlock(thread:670:15:Idx .x), warning: ginitializer order does not match the declaration order [-Wreorder-ctor] ro u670 | p(group t)i,d( tid)| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 0 ? ncclSshmemt.comme.buffSpizeSs[NCCiL_PROzTO_SIeMPLE](/NCCLs_STEPtS/sizeeofp(STi)ze_ == 0 : s tepSi?ze_) nc c{l Shm | em ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .comm.buffSiz e| group(group s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h[:303N:90:C note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here C L_PROTO_SIMPLE]/NC303 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primiti_DEVv_AReITs, /*DiTrect=*/,0 , RPerotdo, 0> Op, FanAsymmetric<1, NCCpriLms _ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hM:565A:5: note: Xin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here _565 | DE VrunTr_eeUpARDITY>o,w /n* p1r,i m1s, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2[ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h(nthreads), tidInBl:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_SITdx.x)E, groPup(groSup), /| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | s tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | i z steeopSizfe(ste(pSize_T )== 0 ? nc:clSh smemt.epSize_co)mm.bu ffS{i z | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupes[NCCL /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | 56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(t=*/h0, Prrotoe, 0> aprimds | I ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:d565:5:x note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here . 565x | r)unTre,eUp Downg | , stepSize(stepSCiOLL_UNROLLz>(tied, nt_hrea ds=,= w0o ? rk)ncclShmem.comm.buf; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hf:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | S i zes[ N CC iL_fPR O(TtiO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives(y).runm(tid,m subten, wtork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cppr:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFIiNE_nccclDevF, /*DliReducre_TREeE_SIMcPLE_MtinMax=_f3*/2_02, Proto,, ncclFun 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown: note: ,expanded from macro 'DEFINE_ncclDevFunc' 611 | RCunOWoLrL_UNROLL>(tid, nthreads, work); kBatc| h, algo, proto, unrol:l>432:(78):. note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | irun(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hf (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:unTreeUpDown<670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] T, RedO p, Pr670 | tid(tid), nthreadsotoSi(nthreads), tidInBlmple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work)ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | tid(tid) , if (tidn < thsIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:reads(nthreads), ubtn) tRuniWordkColl() .run| (ti ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d, subt | n, tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s wtork)e; p| ^S /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cppi:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here ze17 | _D E=F= IN0 E_?n cncclcDleShvmFeumn.cco(mmA.lbluRffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educe_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670611 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLEof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_MAX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primiti 558 | ves, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFIN runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) Ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]nWork 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Coll().run(E_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllRe/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ duce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, Func 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARPIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | _SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: Z/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]E 75 | ; barri er_by\ | ^_group( ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1In file included from , dat/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: aIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:2174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:, flag2warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:warning: 145:21: warning: unused variable 'flag1' [-Wunused-variable]unused variable 'flag1' [-Wunused-variable] 145 | 145uin | uint32_t data1, flag1, data2, flag2; t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from 80 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h: 80:5: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ncclShmem.channelId /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2-: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27: 15: warning: unused variable 'bid' [-Wunused-variable] w27 | o consrt int bkid =- ncc>lShmemc.hannelLochann;elId | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366 - wo:rk->ch15annelL:o; | ^~~ warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = nIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ cclShmem/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | con.chansnelId t- work ->chaninelLno; | ^~~t bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxsendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] w670 | a tird(tpid), nIthreands(ntBhreadls), toidck(threadIdx.x/WARP_SIZInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.E), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclSIn file included from hmem.comm.buffSizes[NCCL_PROTO_LL128buf]fSize/s[NNCCL_PCROTO_CSIMPLLE]/NC_CL_STSEPS/sTizeof(ET) : PstepSSize_) /{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s | group(group i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hzeof(uint64_t)) {:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ =*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: CIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from O/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.hL:498:29:L warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] _ U496 | N tid(Rtid),O nthreLads(nLthrea>ds), (wid(ttid%WiARP_SdIZE), ,warp (tid/nWARP_tSIhreads, work);In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthre | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here ZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIZE), 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_n c | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ c | warp(tid/WARP_SIZEl D498 | e v flFagTuhrenad(c(tid(%4)==3), Agrolup(lgroRup)e, d| ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ u| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 c499 | e _ sTtepRSizEe(nEccl_ShmSem.IcomMm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uintPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) 64_t )) | { ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421::9: 611note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here: 62421 | : note: prexpanded from macro 'DEFINE_ncclDevFunc'ims(tid, nthreads, tree->down, tree->down, wor k ->se611 | RunWorkBatchrecvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: op, algo, proto, unroll>().run(); \ | ^ in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h1070 | : r670unT:ree15Spl:it< T, note: Redfield 'nthreads' will be initialized after field 'tidInBlock'Op 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(,g PrrotooLL12u8, pCOL)L_U,NR OLL >(t| id, ^~~~~~~~~~~~~~~~~ nt hre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hads, work);: | 670 ^ :60/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h::432: 78: note: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here field 'group' will be initialized after field 'stepSize' 432 | if670 (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreaIn file included from ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_S | I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Z | group(groupE ),/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :254| :90 ~~~~~~~~~~~~~~~~~~: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)254 | 497 P | r im it iv esw,A /R*DPir_ecSt=I*/Z0,E P)ro,to , 0> p| rim ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s | ^| warp(tid/WARP_SIZE /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTre eU498 | p D flagThrowen8, ]CO/NLLC_UCNRLOL_L>S(tTidE, PntShr/easdsi, zweorok)f; ( u| i ^ nt/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h6:4432_:t78:) note: )in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~if ( ti| d group(group< su/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hbtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | P ro to prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); ,| ^CO LL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h_UNROLL:>(1070).:ru5n(:ti d,note: sin instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested hereub t n, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_AL1070 | runTreeSplit(tid, nthreaGdO_sTR,EE, NwCCoL_rPRkOT)O_;SI MP LE| , ^ 2)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h::611432:62: note: expanded from macro 'DEFINE_ncclDevFunc': 78611: | note: Rin instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested hereun Wo rkBa432tc | h< co ll , t y, riedfop t, ialdgo, p().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreadL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMsa(nxth_refad6s)4, _ti2dI,nB lonckc(tchrleaFdIudxn.xc),A glrolup(Rgreodupu),c e| ^~~~~~~~~~~~~~~~~, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :F670:u60:n note: cfield 'group' will be initialized after field 'stepSize' M670i | n M taidx(,ti d)d, ontuhrbealdse(n,th reNaCdsC),L t_idAInLBlGocOk(_thTreRadEIdEx.,x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stnthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), epSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' double, NCCL_ALGO_ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), d s(nt| hrea ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ds), ti dInB| lock tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_(thr ea dIdx.x), g671roup | (group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nth, group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hC:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, wor if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here work) ;17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> pr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hi:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gms | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl()./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hrun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSh/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_P/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreaROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx908. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 21 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 21 warnings generated when compiling for gfx90a. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:112: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h::211: : In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:174:: 11/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h: In file included from :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h145::14173:: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hunused variable 'data1' [-Wunused-variable]: 75:7: warning: unused variable 'w' [-Wunused-variable] 145 | uint32_t d75a | t a1 , f l abga1r,r ideart_ab2y,_ gfrloaugp2(;) ; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:: 29warning: :unused variable 'flag1' [-Wunused-variable]15 : note: expanded from macro 'barrier_by_group' 145 | 29u | i n t3 2 _cto ndsatt ai1n,t fwl a=g 1t,h rdeataad2I,d xf.lxa/gW2A;R P _| S ^~~~~I ZE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h;: 145\: 28 :| ^warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from 27 | const int bi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] d = ncc218lSh | me m. cha nne lIdc - owornk->schtann elLoi; n| ^~~t bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from :218:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h15: :warning: unused variable 'bid' [-Wunused-variable] 11 218 | : In file included from co/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hnst int bid = ncclShmem.channelId - work->channelLo; | ^~~ :175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrie/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclr_Sby_hgmrouep()m; .| ^~~~~~~~~~~~~~~~~~ c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:h29:a15: nnote: expanded from macro 'barrier_by_group'n 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ elId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ mem.channelId - work->channeIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ lLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from :2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, P:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().ruIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ n(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), :670:15n: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t tid(thid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hIn file included from :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] :75 | 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->cIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threhananelLo;d | ^~~ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const in_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - w/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ork->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hzes[NCCL_P:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*== 0 /? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ OLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatcIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ h, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pr:o670:15: twarning: initializer order does not match the declaration order [-Wreorder-ctor] o670 | ti, unroll>().run();d(tid), nthreads(nthreads), tidInB \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), lo| ck(t ^~~~~~~~~~~~~~~~~hread Idx.x/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h), group(:group)670, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_60 671 | : stepSnote: ize(field 'group' will be initialized after field 'stepSize'stepS ize_ == 0 ? ncc670lShme | m.com m.buff Sizes [NCCL _Ptid(tid), nthreads(nthreads), tidInBlocROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupk(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h::670611::1562:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWor k670B | at c h < ctoildl(,t itdy),, rendtohprs,( natlhgroe,a dpsr)o,t ot, iudnIrnoBlllo>c(k).(rtuhnr(e)a;d I\d x .| x ^) , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tid:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ?/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here ncclShmem .254c | o mm . b u f fPSriizmeist[iNvCeCsL<_TP,R ORTeOd_OSpI,M PFLanEA]s/yNmCmCLe_tSrTiEcP_,) /{* D i| re ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c t =| * group(group/ 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h : 63r:u56n:T rnote: eein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereU pDown, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Simple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, w:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrol, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herel>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrolLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ l>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_17 warnings generated when compiling for host. 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ead, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7In file included from :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.headIdx.x/WARP_SI:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:Z174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7E: warning: unused variable 'w' [-Wunused-variable] ; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->chIn file included from annelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId -In file included from work/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp-:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:>11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] c75 | h barriaer_by_ngrounelLo;p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = tIn file included from h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ readIdx.x/WARP_SIZE; \ | ^~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:15: warning: unused variable 'bid' [-Wunused-variable] 366 | : cons145t int: bid =21 ncclS:hmem.ch annelIwarning: d - wounused variable 'flag1' [-Wunused-variable]rk->ch annelL o; | ^~~ 145 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11 da: ta1, fIn file included from lag1, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hdata2,: fla174g2; : | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28:: warning: unused variable 'data2' [-Wunused-variable]75:7: 145 | uwarning: int32_unused variable 'w' [-Wunused-variable]t data 1 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | , f lag1, cdata2,o flag2n; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hs:145:35:t warning: unused variable 'flag2' [-Wunused-variable] 145 | i uinnt32_tt data1, w = thre flag1, data2, flag2; | ^~~~~ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channel 29 | Id - work->channelLo; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.honst int bid = :n218:15:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ cwarning: unused variable 'bid' [-Wunused-variable] c218 | lShmem.channelId con-st int bid = wnccork->channelLo; | ^~~lS hmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLoIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const ; | ^~~ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId -In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | Pri tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_mitives, /*671 | stepSize(stepSize_ == 0 ? nDicrect=c*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, CAOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | tn ^, work );/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp::7:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, 0, 2, 2>::run' requested here 7o | DEFItNE_onccSlDevFiunc(AmllRedupce_TRlEE_SIMePLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | <1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthread tsid(ti,d), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); ck(th readI| dx.x ^), gro up(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | >, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | oto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIn file included from IZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670497:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrea | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), groupd(s(nthregads),r tidInoBlocku(threadIpdx.x), g)roup(gro,up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSizse(stepSitepSizzee_ == 0( n?c cnlcSchlmSehm.comm.buffSizes[NCCL_PRmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeofOTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tr(T)e : stepSeize_) { - | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ >| group(group d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | o Primwitives,r /*Directk->s=*/0,e Pnrdbuff, work->reoto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | cvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here runT432reeUp | Down, COLL_UiNROLfL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11 runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl()671 | stepSize(s.run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_ == 0 ? ncc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadlShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: Inote: dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmfSizes[NCCL_PROTem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepuffSiSzes[NCCLi_PROTO_SIzMPLE]e/NCCL_S_TEPS/si)zeof(T) : stepSi{ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitiveoup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5:| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | ru note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here n T558 | r runReing(tidn, nth, 1, 2, 2>::run' requested here 432O | p if (,tid < subtPn) rRunWorkColl().run(tid, otoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColls, 1, 2, 2>::run' requested here R12 | DEFIeNE_ndcclDOevFunpc(A,llRe duce_ARINGl_SIMgPLE_oMinMa,x_u6 Proto, COLL_UNROLL>().run(tid,4_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :611N:62: Cnote: expanded from macro 'DEFINE_ncclDevFunc'C L611 | Run_WorPkBaRtchO, PalgoL, pErot,o, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 ^~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h : 670: 60: note: tfield 'group' will be initialized after field 'stepSize' i670 | d (tidt(tiidd)), ,nth renadts(hntrhreeaads),d tsid(InnBlotckh(trhreeaadIddxs.x)),, gr outp(igrdouIp)n, B l| ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ go, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread:670:15:s warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ) tid(t,id), n threads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here x.x), grotidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE,tidInBlock(threadIdx.x), gr o4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hf:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | , nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiz eProto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFI { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_A/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hncAllReduce, FuncMi:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInMaxn, uintB64_t,l NCCLo_ALGO_cRk(threadINGI, NCCdL_PROTxO_SIM.x), group(gPLrE, 4)o | u^ p/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611):62: ,note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru| nWor ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~kBat ch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 0 ? nc670clS | hmem .com m.b tiduf(fSitzes[iNCCLd)_P, nROTO_SIMPLE]/NCCL_STEPS/stihreazds(nethreaods),f tidI(nBloTck() : stepSthrieadIzdx.xe_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~), gr ou p(| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primgriotup)i, v| ^~~~~~~~~~~~~~~~~e s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:<670:60T: note: ,field 'group' will be initialized after field 'stepSize' 670 | Rtid(etidd), Onpt, FanSymmetric<1>, 0, Proto, 0> prihrmeadss(nt hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tnote: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 21 warnings generated when compiling for gfx942. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cppgroup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29::2: 15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hi:11: nIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:t175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h: 80:5:w warning: unused variable 'w' [-Wunused-variable] =80 | tbarhrierr_bey_adIgdrouxp();. | x ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/:29:WAR15: note: expanded from macro 'barrier_by_group' 29 | const int w = tP_ShIZE;r \ eadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | cons/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cppt int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ IZE; \In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE | ^ ; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | const int w = threadIdx.x/WARP_SIZE;In file included from \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.chIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ annelId - work->channelLo; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channel/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.cLo; | ^~~ hannelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ unused variable 'bid' [-Wunused-variable] 27 | const int biIn file included from d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ZnE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ cclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(ntL_AhLGO_TrREE, eNCCL_aPROTOd_SIMPsLE, 2)) | ^, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :611:62:w note: expanded from macro 'DEFINE_ncclDevFunc' i611 | d RunW(orkBattch, algo, WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>( tid(tid), nthreads(nthre).run(); \ | ^ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(ttihreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, F[ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o uncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp RunWorkBatch, algo, proto, unroll>()/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SI.run(); \ | MPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ht:i670:15d: note: )field 'nthreads' will be initialized after field 'tidInBlock' ,670 | n titd(thid)r, nethraeadds(ntshre(adsn), ttidhInBrlocek(tahredadIsdx.x)), ,gro up(tgroiup)d, I| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threandBlIockd(thxrea.dIdxx.x)), ,gro up(group(group), group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPtid(tiLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREd), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(t), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSizid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:e15duc:e, Funwarning: cMininitializer order does not match the declaration order [-Wreorder-ctor]Max , uint8_t, 670NCC | L_A LGO _TR EE, NCtCL_iPROdTO_(SIMtPLEi, 4d) )| ^, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :611:n62: tnote: expanded from macro 'DEFINE_ncclDevFunc' h611 | r e RuanWorkBatchh, ralgeo, parotdo, sunr)oll,>() .rutn(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tididInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO(_tidS), InthrMeadPs(nLthrEead]s),/ tiNdInCBloCck(Lthr_eadISdxT.x)E, gProupS(gr/oups), i | ^~~~~~~~~~~~~~~~~z /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.he:670o:60: fnote: field 'group' will be initialized after field 'stepSize' ( T670 | ) t id(:tid ), snthtreaeds(pnthSreadis),z e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0>/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < sub t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ctomm.bufifSizes[NdCCL_PRO(TO_SIMPtLE]/NCCiL_STEPSd/s), nthreads(ntizeof(T) : stepSihreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hS:63:56:h note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63m | Priemitivesm, 0m, Prmoto, 0>. prims b | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.huffSizes[NCCL_PROTO_SIMPLE]/NCCL_ST:558EPS/:5: note: sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | i zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h runRing:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, 1, 2, 2>::run' requested herel 432 | e < i1f (tid, < su btn) R1unWor,kColl< Fn, TC, RedOOp, AlLgo, PLroto, _COLLU_UNRONLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432c(A:llRed78uce_R:ING_S IMPLEnote: _MinMin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested hereax_u8_ 2, n cclFuncAllRe432duce, | FuncM inMax if (tid < subtn) RunWorkColl().run(tid, ,s uiunt8b_t,t NCnCL_,ALG O_RwINGo, NCrCL_kP)RO;TO_ SIM PLE| , 2 ^) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' :611 | 7 :Run1Wor:kBa tchnote: , 0, 2, 2>::run' requested herell, ty , redop7, DalgEo, proto,FINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax ,unr ollu>()i.runn()t; \8 | _ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ht,:670: 15: Nnote: field 'nthreads' will be initialized after field 'tidInBlock' C 670C | L t_id(AtidL), GnthrOead_s(nTthRreadEs), Etid,InB locNk(tChreCadIdLx.x_), PgroRup(OgTrOoup_), S IMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ tid(tid), nthreads(nthreads), tidInBlock(th, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:re670adI:dx.15x):, gnote: roufield 'nthreads' will be initialized after field 'tidInBlock'p/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17( g roup | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ )670, | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, sub:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ stepSize(stepSiz/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrole_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | l>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' o, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hi:670n:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] g 670 | < tid(tTid), n,threa RedOp, Prods(ntthreados),, COLL_UNROLL>(tid, nthreads, wo rtidInBklock(t)hrea;dIdx.x ), gro up(grou| p), | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (t | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMid < subtn) RunWorkColl().run(tid, subtn, work); | ^P LE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp]/NCCL_STEPS/sizeof(T) : stepSiz:e_)22 { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclD:303:e90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here v 303 | F Punc(AllReduce_RING_SrimitiIves, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:ncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here :565 | 611 ru:nTre62eUpD:own< T, Rnote: edOpexpanded from macro 'DEFINE_ncclDevFunc', Pr otoS i611 | m ple RunWorkBatch, algo, proto<,1, 1, COuLL_UnNROLrL>, CoOLL_lUNROLlL>(t>id,( nth)read.s, wrork);u | ^n /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h(:432); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | :670:15 : note: field 'nthreads' will be initialized after field 'tidInBlock' if (tid < subtn) RunWorkColl().,run (ti d,| ^~~~~~~~~~~~~~~~~su /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tidbtn, wor)k);, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cppnt:17:h1: rnote: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here e 17a | DEdFINEs_nc(clnthrDeevFaunds), tidInBlock(thrc(eAalldRedIuce_dTRExE_.SIMPxLE_)Mi, group(group),nM ax_ u| 8_4 ^~~~~~~~~~~, n cclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (ti 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),< subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, Fu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hn | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~nthrea ds(nt hreads| ), tid tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_InBloc k(thre adIdx.x), gr671oup(gr | oup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nt stepSize(stepShireads(znthreaeds), ti_dInBl ock(thr=ea=dId 0 ? ncclShmem.comm.buffSizes[NCCL_PRx.x), group(group), | ^~~~~~~~~~~ OTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h | t:i670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565.:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hereb 565 | runTreeUd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.huffSizes[NCCL_PROTO_:SIMP432LE]/:NrCeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 78CL_S:TE PS/note: sizein instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereof(T ) : step432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, , T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncc0l> Fpriums n | ^c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hA:565l:5:l note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hereR e d565 | u cruneTree,Up DownF, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (ti_dALG O_T, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), ()e.ruan(tdid,s su)btn,, w orkt); i | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nt8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl()./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAZE; R\ | ^ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hunused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | In file included from ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp173:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:: 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:: warning: unused variable 'data1' [-Wunused-variable] 75 145 | : ui7nt32_t :data1, f lag1, dwarning: ata2, unused variable 'w' [-Wunused-variable]flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | 75 | uint32 ba_t datrria1,e flrag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclSIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid =/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:c75:7: warning: unused variable 'w' [-Wunused-variable]c 75 | l baSrrier_byh_groump(); | ^~~~~~~~~~~~~~~~~~e /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:m15: note: expanded from macro 'barrier_by_group' . 29 | c conh:366:s15:a warning: unused variable 'bid' [-Wunused-variable] t n366 | nconsit eint nbild = tncIclSh medm.chawn nel Id -- wo=rk- >ch annwelLtoo; h | ^~~r k->channelLo; | ^~~ readIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:ulSu175m_bf16: _2, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.hncclFun:cAllR498educe:, Func29PreMul:Sum, h ip_bflwarning: oafield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | t16 , NCCL _ tid(tid), nthrAeLGO_TRaEE, NCdCL_PROsTO(nth_SrIMPeads), LE,w 2) id(tid%WARP_SIZE), warp(t| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | ofll>().lrun();a \ | g ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hT:670:15: note: hfield 'nthreads' will be initialized after field 'tidInBlock' 670 | r teid(tiad((tid%4d),) nth=re=3), group(grados(nthrueads), ptidI), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 n499Block( | thread Idx. stepSizx),e group((grouncclShmp), em.co | ^~~~~~~~~~~~~~~~~ m/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hm:670:.buffSi60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlockzes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, (threadIdx.xw), goroupr(grokup),- | ^~~~~~~~~~~ >sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ Op, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hnthreads), tidInB:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tilock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunW orkBatsch, algio, przoto, eunroll(>().rsun();t epSi\ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hstepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDeCvL_SFTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTrunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatcheeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollll>(().r)un(.); r\ u| ^ n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBloc(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | k(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Prim 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ itives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ : note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreRedOp, Algo, Proto, COLL_UNROLaLds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, _bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if | (tid(tidt), nthreadis(nthreadds), tidI nBlock(thr().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SI_STMEPSP/sizeoLf(T) :E stepS_ize_) P{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ r | group(group e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hM:63:56: unote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here l63 | Sum_b Primitives, 0, Proto,f16_ 4, ncc0lFuncA>llRedu ce, FupncPreMrulSum,i hip_bms | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRfloati16, NnCCL_ALgGO_TRE, algo, proto, unroll>().run()OLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, d(tid)s, nthrueads(nbthreads), tidInBlotck(thrnead, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cppIdx.x), group(gro:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here up), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nt/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hhreads(nthreads), tidI:nB670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | lock(threadIdx.22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROT O _x), group(group), | ^~~~~~~~~~~ SIMPLE, 4) | t^id( tid)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h, nthreads(nthre:ads)611, ti:dInB62lock:(thr eadInote: dexpanded from macro 'DEFINE_ncclDevFunc'x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671611 | | ste pS ize( s RunWorkBatepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllRePROTO_tSch, alEgo, proto,] unro/ll>(N).ruCn();C \ L| ^ _STEPS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /stizeoid(tid), nthreads(nthreads), tidInBlockf(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xduce_RING_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>() FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(t.run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hl:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_P/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(ntE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives< 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>(17 warnings generated when compiling for host. ).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_tIn file included from data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.houp(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218: | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h15:29:15: note: expanded from macro 'barrier_by_group': 29 | warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShm/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ em.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :co366n:s15t: inwarning: tunused variable 'bid' [-Wunused-variable] w = threadIdx.x /366W | A R P _ ScIoZnEs;t \i n t| ^b id = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warpIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, wIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2or: k); In file included from | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | : if (11tid <: suIn file included from btn) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hRunWo:rkCol175l().warning: run(field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]tid, sub tn, work)496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ntp), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == C0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | >, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSize tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.houp(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0>/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPrCL_PeROTOM_SIuMPLEl]/NCSCL_SuTEPSm/size,of(T) : ste pSizre_) { c | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ c| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hl:254:_90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereb 254f | l oPriamititves8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:<611T, R:edOp62, Fa:nAsy mmnote: etriexpanded from macro 'DEFINE_ncclDevFunc'c, /*Direct=*/0,611 | RunWorkBatch, Praotlo,g 0o> ,pr imps r | ^o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hto,:565 :5u: nnote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested herer o565l | l > r(un)Tr.eerUpuDn(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x | ) , igf r(toidu

().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runToup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShx), group(group), | ^~~~~~~~~~~ mem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hk(th:670r:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]e 670a | tidd(tid),I nthredads(ntxhrea.ds), txidInBlo)ck(thread, groupId(x.x)g, roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepgroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCcLlShmem_.comm.SbuffSizTes[NCCEL_PROTOP_SIMPLSE]/NCCL/_STEPS/sizeof(T) :size soft(T) : epSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Pr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.himiti:ves, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROIn file included from L/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtL>(tid, nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(, algo, proto, unrollnthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12S | DEFINE_nchclDevFunmc(AllRedeuce_RING_mSIMPLE_P.reMulSumc_bf8_2, oncclFunmcAllReducme.buffSizes[NCCL_PROTO_SIMPL, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611E]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> | RupnWorkrBatchi, alg| o, pr ^oto, u nroll/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h>().run(); \ | : ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h565:670:15: :note: field 'nthreads' will be initialized after field 'tidInBlock' 5670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr: onote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here u565 | p runT)reeUp,Down< T, Re dOp, | Pro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : steto ^~~~~~~~~~~~~~~~~Simple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunW/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:o60: note: field 'group' will be initialized after field 'stepSize'r 670 | k tiCd(tiod), lnthrelads(n, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock((group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hthreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ anAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, w/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_S:I670:15: Mwarning: initializer order does not match the declaration order [-Wreorder-ctor] P670 | L tid(Etid),_ nthrPeads(rntehreadMsulSum_bf8_4, ), tindInBlocck(thcreadIdlx.x),F grouup(gnroup)c, A| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~llReduce, FuncPreMulSum, rccl_bfloat8, N | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTOCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==d, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, al/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ go, proto, unroll>().run(); \ | ^/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRingock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx942. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t d | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ata1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelIIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ d - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work)In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclD, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PrevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBat| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ dIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] elLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hn:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173t: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75: 7: warning: unused variable 'w' [-Wunused-variable] w75 | ba= threadIdxrrier_.by_groux/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: In file included from warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp,: flag12, data2,: flagIn file included from 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35:: warning: unused variable 'flag2' [-Wunused-variable] 145 | 11 u: int32_In file included from t d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | ata1, fl ag1, da ta2, fl barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hag2; | ^~~~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barriIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uin:t32_t d218:15ata1, flagIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from ag2; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11| : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] :173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | 145 barri | er_by_g roup(); uint32_t da | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:t29:15: note: expanded from macro 'barrier_by_group' a1, flag1, da29 | constt inta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h w = threadIdx.x/WARP_SIZE; \ | ^ :145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - SIZE; \ w | ^ In file included from ork->c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,hannelLo; | ^~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float,eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here ymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | i 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ f (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), LE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'tid InBloc k(threadIdx.611x), gr | oup(grou p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSRize(stuepSizen_ == 0 W? nccloShmrkBatch, algo, proem.ctomm.buoffSizes,[NCCL_ PROTO_SuIMPLE]/nNCCL_SrTEPSoll>().run()/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | ; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | Pr imit ivest, h/*Dirrecte=*/0a, Prdoto,s 0(> prnims t | h ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hr:eads), tidInBlock(threadIdx.x), group(group565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(), | ^~~~~~~~~~~ tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7: | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:(15: warning: initializer order does not match the declaration order [-Wreorder-ctor] )670 | . tid(trid),u nthrenads(nth(reads)t, tidIinBlock(dthread,Idx.x) subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1:, gr oup(note: group)in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step12 | DEFINE_ncclDevFunc(AllReduce_RING_Size(stepSize_ == 0 ? ncclShmem.comm.buffSiSIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLzesE[NCCL_,PROTO_ SIMPLE2]/NCCL)_STEP S/size of(T) | : step^Size_) { | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254::90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here611 254 | : 62 Primit:ives note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchmmet,ric< NCCLa_MAXl_DEVg_ARIoTY, 1,>, proto, unroll>().run /*Direct=*/0, Proto, 0> prims(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nth r| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.he:565a:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here d565 | s ru(nTreneUptDowhn, :COLL60_UNR:OLL> (tidnote: , ntfield 'group' will be initialized after field 'stepSize'hrea ds, wo670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouprk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBat/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ ING_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ reMulSum, float, NCCL_ALG/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ l, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st:670:e15: warning: initializer order does not match the declaration order [-Wreorder-ctor] p670 | Stid(tiid), nzthree(stepSize_ == 0 ? adns(nthrceads),c tidInlBlock(Sthrhmem.commea.dIdx.x)b, grouup(groffSizes[NCCL_PROTO_SIMPLE]/NCCLup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=/siz*eof(T)/ : ste0pSize_,) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group P/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:roto, 0> prims 90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here | 254 | ^ Prim itiv/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hes, ProtoSimple<1, 1, 4>, 4>' requested here t565ric | , /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollo, CO,LL_U NROLPL>(trid,o nthtreaods,, wor k); C | ^O /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hL:432:78L: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here_ UNROLL>().run( 432 | t i if (tid < subtn) RunWorkColl().ru1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hT:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | REE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMn(utild, Ssuubtnm, w,ork ); f| ^ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:o17:1a: note: tin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here ,17 | DEFNINE_ncclCDevCFunLc(A_llRAeduLce_TREE_GSOIM_PLET_PrReMulESuEm_f,32_ 4, NnccClFuCncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TL_RPROETO_SEIMP,LE N, 4C) C| ^ L/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:_611:62P: note: expanded from macro 'DEFINE_ncclDevFunc'R O611 | T ROunW_orkSBatIch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 :| ^ 15/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthre:a611:d62: note: sexpanded from macro 'DEFINE_ncclDevFunc' (611 | n t RunWorhkBartceh, ailgod, pIrotno, Bunrlollo>()c.runk();( \ t | ^ h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hr:670:15e: note: afield 'nthreads' will be initialized after field 'tidInBlock' d670 | tid(tid), nthreads(nthreads), tidIIdxn.x)B, glrouop(gcroukp),( t| ^~~~~~~~~~~~~~~~~ h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:r670:60:e anote: field 'group' will be initialized after field 'stepSize' d 670I | d x.x), group(group), | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro ^~~~~~~~~~~~~~~~~u /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hp:670:)60: ,note: field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, MProto, 0P> primsL | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hE:558:5: ]note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | / runRNing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct Al=go, Pr*oto, /COLL_0UNROL,L>() .run(tPid, srubtn, owork)t; | ^o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12,:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 012 | D>EFINE _prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: nccin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested herelDevF unc( AllReduce_565RI | runTreeUNG_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, fpDowln, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < NG, sNCCL_uPROTOb_SIMtn) RunWorkColl,o algo,, p COLL_UNROLL>().rroto, unroll>().run(); \ u| n(t ^id, su/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hbtn:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthre,a wdorks); )| ^ ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp :17:t1: inote: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here d17 | IDEFnINEB_nclclDoevFcunkc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce,( thFreaudIdnx.xc), Pgrorupe(grMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE,oup) , 4| ^~~~~~~~~~~~~~~~~ )/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 :60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hCL_PROTO_SIMPLE]/NCCL_:670:S15: warning: Tinitializer order does not match the declaration order [-Wreorder-ctor] E670 | P tiSd(t/i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(d)T, nt)hre : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:a56ds(nthreads):, ti dInBnote: lockin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here(thr eadI dx.x), 63gr | Primitivesoup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | , 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5 :step Sizenote: (sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested heretep Size _ 558 | == 0 ? ncclShmem.c omm.runbufRfSizies[NnCCg(tid, nthreads, work); | L_PR ^OTO_ SIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hPLE]/NCCL_:STEPS/siz432eof:(T) 78: st:epSi zenote: _) in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:432303:90: | note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | if (tid < subtn) RunWor Primitives().run(tid, subtn, work); | ^i c<1,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp NCCL_MAX_DE:V_AR22ITY>:, /1*Dir:ect= */0,note: Proin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereto, 0> prims | ^22 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h: | 565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here D565 | E ruFnTreIeNE_ncclDevFunc(AllRedUpuDownc,l COLRL_UNeROLLd>(tiud, ncthreeads,, wo FuncPreMulrk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: Sum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432: | 611 i:f (t62id :< s ubtnnote: ) Rexpanded from macro 'DEFINE_ncclDevFunc'unWor kCol l, algo, proto, unro, CoOLL_lUNROLlL>()>.run((t)id, .subtrn, wuork)n; | ( ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp):17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17; | DEFI NE_\ncc lD | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'evFu nc(A llReduc670e_TRE | tid(tid)E_,SIMP LE_PnreMutlSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCChLreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nt | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hhreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDe/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hvFunc(AllR:educe670_TR:EE15_SIMP:LE_Pr eMuwarning: lSuminitializer order does not match the declaration order [-Wreorder-ctor]_f32 670 | tid(tid), nthreads(nthre_4a, nccdlFunscAl)lRed,uce, FunctPreMiulSudm, fIloatn, NCBCL_AlLGO_ToREE,c NkCCL_(PROTtO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, prhreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEoPto,S un/roll>s().riun()z; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: efield 'nthreads' will be initialized after field 'tidInBlock' o670 | f ti(d(tTid),) nth read:s(nt hreadss), ttidIneBlocpSize_) { k( threadIdx.| x), ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ grou p(gr| oup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.ht:i63:56d: note: (in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here t63 | i d P)ri, nthreads(nthreads), tidInBmitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | lrocku(thnreaRdIdxi.x)n, ggrou(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buff/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hS:670:15: warning: iinitializer order does not match the declaration order [-Wreorder-ctor] 670 | z tid(etid), nthsre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ads(n[threads),N tid I670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nCBlock(thCL_PROTreadOIdx._SIMPLE]/NCCL_x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLSTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | U PrNimitives(tiFanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:d, nthre5ads, wor:k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | r:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunTureeUnpDownc, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollu().nrunW(tiod, srubtknB, waorkt); c | ^h /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp<:17:c1o: note: ll, in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here t17y | DEF,INE _nrcclDeevdop, algoFun,c(A llReduce_TREpEroto, un_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce,r ollF>()u.ncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLErun(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), 4,) | ^ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:i611:62d: note: expanded from macro 'DEFINE_ncclDevFunc'I n611 | B lRunoWorckBaktch(,a algdo, pIrotdo, xunr.ollx>()).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670, | gr oup (gr oup ), t| ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlocS/sizkeof(T)( : stetpSize_h) { rea | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ d| group(groupIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~Pr | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | imit ives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDownSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, /*DdirecOt=*/0p, Pro,to, 0> primAs | ^ l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:g5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here o 565 | , run TreeUPpDownr().rRuedOp,n Prot(oSimptle<1, i1,d, subt COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, fild < osubtan) RtunWo,rkCo ll(T).run(tid, RsubtEn, woErk);, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cppNCCL_PROTO_SIM:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_fPLE3, 4)2 | ^_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h4:611:62:, note: expanded from macro 'DEFINE_ncclDevFunc' 611 | n cRunWcorkBaltcFh, algo, proto, unroll>().run(); \ | un ^cP r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.heMulS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:um15, f:loa t, note: NCCfield 'nthreads' will be initialized after field 'tidInBlock'L_A LGO: _670TRE:670E,15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid) | , tid(tid), nthreads(nthrea dNCCsL_PR)OTO,_SI MPLtE, i4) d | ^I /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hn:611:B62: lnote: expanded from macro 'DEFINE_ncclDevFunc' o611 | c knt( threahds r(neRthaurednadIWs)do, xrti.kxdIn)BlB,oca k(tgthcrrhoea,:, algo, proto, unroll>().ru | n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ (| ); \ | ^ 670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670: tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_15 : note: 671 | field 'nthreads' will be initialized after field 'tidInBlock' st670ep | Si ze (s te pStizie_d =(=t i0 d? ncclShmem.comm.buffSizes[NC),C nLth_rePadRs(OntThrOea_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) {ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads ( | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupn /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.ht:h63:56r: enote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested herea d63 | s ) ,Pr imititveisa, d0,I Pdroxto., x0>) p, group(group), | ^~~~~~~~~~~ rims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f3in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 2_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from ier_b/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2y: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11_: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:group();173 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75 :7: warning: | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15:unused variable 'w' [-Wunused-variable] 75 | note: expanded from macro 'barrier_by_group' 29 | bar rier const int w = threadIdx._bxy_gro/upW(); A| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:R29:15: Pnote: expanded from macro 'barrier_by_group' 29_ | SIZE ; c \ ons t | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h::174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h29:75:7: :warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gr15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.chIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dataannelId - work->channelLo; | ^~~ 2, f/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cppl:2: In file included from a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hg:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:2145:14: warning: ;unused variable 'data1' [-Wunused-variable] 145 | uint| 32_t ^~~~~data1, fl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from warning: unused variable 'flag1' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag1, data1452, f | la uint3g2; 2| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h_:t data1, fla145:g21: warning: unused variable 'flag1' [-Wunused-variable]1 145 | , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hui:nt32145_t d:ata281, fl:ag1 , dwarning: unused variable 'data2' [-Wunused-variable] 145 | uata2,i flag2n; t32_ | ^~~~~ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h data:1145:28: , fwarning: lunused variable 'data2' [-Wunused-variable] ag1, data2145 | uint32_t data1, flag1, da, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11:: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h29:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h::8015:5:: warning: unused variable 'w' [-Wunused-variable] note: 80 | expanded from macro 'barrier_by_group' bar rier_29by_g | r const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | co/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:n2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hs:27:t15: warning: unused variable 'bid' [-Wunused-variable] i 27n | t c In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ bid = ncclShmem.chanonsnt ient lbidI = dncc lSh-mem .chwannoelId - rwko->rk->cchahnneannelLo; | ^~~ lLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclS:218:h15: mwarning: unused variable 'bid' [-Wunused-variable] e 218 | m . cocnsth inat bnid = ncnclSehmelm.chIad - work->channelLo; | ^~~ nnelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: :366:15: warning: unused variable 'bid' [-Wunused-variable] 366warning: unused variable 'bid' [-Wunused-variable] | const in/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.ht:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ bid = ncclShmem.channel I366 | d con-st intw biod =r nckclSh-mem>.chcannhelIad -n wornk->elLo; | ^~~ channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiIn file included from ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from //builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from 0/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:,670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] P670 | r tiod(tidt), ntohread,s(nthre ads),0 t> pridIniBms | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hlock(thread:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runIdxT.x)r, gerouep(grUouppDown,P CORLL_OUNRTOLL>O(ti_d, SnthIreMadPs, Lwork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl) {( | ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | . group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hru:254:n90: note: (in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here t254 | i d ,Pri misubtn, work);tives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTre | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | eUp^Dow n, | CO LL_ UN RuROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl,_ aUlgNo,R pOroLtoL, >un(r)ol.l>(r).urun(); \ n (t| id ^, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreEFINE_ncclDevads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE,l lReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ metric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtnp)), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_u n671 | W o strepSkiCze(ostelpSilze_< =In file included from Fn, T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, =w 0 o? nrcclSkhme)m.c;omm .bu ffS| ize ^s[N CCL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp_PROTO_S:IM7PLE:]/N1CCL:_ST EPS/note: sizin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested hereeof (T ) : step7Siz | e_)D { E | FI ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ NE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREyEm,me trNicC<1C, LNC_CLP_RMAX_ODETV_OAR_ITSY>I, M/*PDiLreEc,t= */02, )Pr o to| , ^0> p/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hrims: 611| ^: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h62::565: 5:note: note: expanded from macro 'DEFINE_ncclDevFunc'in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | 611 | ru nT re eU pDRowun, COLL_UNROLL>(tid, nthreads, work); | ^ edop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hnt:432h:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested herer e432a | d isf (tid( ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLhrEea_dsP),r tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tiIn file included from d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : step/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303z:90:e note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested heres [303 | N PriCmitiCvesL, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, CO LL| _UN ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ROL L>( tid| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, 0, 2, 2>::run' requested here _ 432ARITY, 1>, /*Direct=*/0, Proto, 0> prims | | ^ i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hf (t:id565 <: s5u:bt n)note: RunWorkColl().run(tid, subtn, work) ; stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, 0, 2, 2>::run' requested here 17 | D,E FCINEO_LdOp, ProtoLL128, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ncLclD_evUFuNncR(AOllLReLdu>ce,_T RCEOE_LSL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtInM)P LER_PurenWMuolSrumk_fC64_2, onccllFlunc().run(tiWodr,kB atscubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncchA, lallgoR, eprdotuo,c uenr_olTl>R()E.rE_unS()I; M\ P | L ^ E/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h_:670P:15:r note: efield 'nthreads' will be initialized after field 'tidInBlock' M u670 | l S utimd(_tif64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGOd), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>()/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h.run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), n:t670:h15:r warning: einitializer order does not match the declaration order [-Wreorder-ctor] a d670 | s ( ntithreads), tidInBlock(threadIdxd(.tixd), n)th,re adgs(nthrreaodsu),p t(gidrInoBluocpk(t)hr,eadId x.x ),| ^~~~~~~~~~~ g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | P:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tidrimitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ (t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hid), nthrea:ds(n432thre:ads)78, ti:d InBnote: lockin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here(t hrea dIdx.x),432 gro | up(g roup ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stiepSifze( step(Sizet_ ==i 0 ?d nccl Shme().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLEpSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | :670: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tiid)f, n thr(eadst(ntihredads ), tidInBlock(threadIdx.x/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h), group(grou< psub)tn), Ru nWo rkC| ol:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 l()o.rumn(tmi.d,b suubftn,f woSrk);izes[NCCL_ P| ^ R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cppO:17:1: note: Tin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here O_SIMPLE]/17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFunNCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, PcArllRoedutce,o Fu,ncP reM0ulS>um, doupbrle,i NCmCL_sALG O_T | ^ REE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gCOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work)In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]um 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | roup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBloIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]ck(threadIdx.x), 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here ad432s(nt | hrea ds), tid InBl ock( thre adIdix.x)f, gr oup((groutp), i | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ d < sub | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ n)671 | RsteupSizne(sWtepoSizre_ k== C0 ? onccllShlmem<.coFmm.nbuf,fSi zesT[NCC,L_P RORedOp, Algo,TO _SPIMPrLE]o/NCCL_STEPS/sizeofto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp(:T) 7: st:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreepSiMzue_)l { S | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u | group(groupm /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h,:254 :90:d note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here o 254u | b l Preimi,tiv es, /*NDirect=*/C0, CProto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, BCOLaL_UtNROcLL>h(, algo, proto, unroll>().run( )nth;rea ds,\ wor k) ; | | ^ ^/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h if (:tid670 < :subt15n) :Run Wornote: kfield 'nthreads' will be initialized after field 'tidInBlock' 670 | tColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL12/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h8]/NCCL_STE:P670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670S | /sizeof(ui ntid(ttid),6 4_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56:nthreads (nthnote: readin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested heres), tid InBlock63/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | Pr(thireadmIdxitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRingOTO(_SItMPLiE]/dNCC,L _STEnPS/tsizherof(eT) :a stdepSsize,_ work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hi:303:f90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here (303 | t i d Pr imi, /*Direct=*/0, Prot Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:10:1: note: oin instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here, 0 > pri10ms | D| ^E /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hF:I565:N5:E note: _in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here n c565 | c l rDuneTvreFeUupDnownc<(AllReduce_RING_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreTM, uReldOSp,u Pmro,to Sidmpole<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid u, algo, proto, unroRlOlLL>>(().)run.(tridu, nsu(bt)n,; w or\k) ; | | ^ ^/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp :7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] stepSi 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0>ze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_ST prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims , Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_R| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hPreMulSum_f64_2, nc:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadsclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | (n threRads)u, tindInBlWock(otrkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSi | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < s:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groubutn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested herep), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.b u17 | DfEFIfNES_nciclDzevFeuncs(All[RedNuCce_CTRELE_S_IMPPLE_PRreMOulSTum_Of64__4, SnccIlFuMncAlPlLE]/NCCL_SRTedEPucSe,/ FsunicPzreeMuloSufm,( dTou)bl e,: NC CLs_AtLepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | PGrOi_TRmEEit, iNCvCLe_PsRO, /*Direct=*/0, Proto, 0> prims < | t ^y> , /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.halgo:, p565ro:to5,: u nronote: llin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here>( ). run(565); | \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :r670:u15:n note: Tfield 'nthreads' will be initialized after field 'tidInBlock' re e670 | U p tDido(twidn),< nTth,r eRades(dntOhrpea,ds ),P tirdIonBtloSimple<1, 1, COLL_UNROLL>, COLL_UNRoOckL(tLhr>ea(dtIidxd.x,), grnoutp(hgrrouep)a, d | s ^~~~~~~~~~~~~~~~~ ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: 670:w60: note: field 'group' will be initialized after field 'stepSize'o r670k | ) ; t id (t| id), nthr ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hea:ds432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | (intfhr ea(dst),i tdid In().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Prot/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSio, 0> prims | ze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Prot/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ho, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, un/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduceroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from g/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: 15In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75::7: warning: note: expanded from macro 'barrier_by_group' unused variable 'w' [-Wunused-variable] 29 | 75 | barrier_by_grIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.ho:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hu:145:14: warning: unused variable 'data1' [-Wunused-variable] p 145 | (uint3); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:2_t29 data1,: flag1, d15at: note: expanded from macro 'barrier_by_group' 29 | a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint 32_t d ata1const , fliant w = thg1, data2, f l const aint w = gthreadIdx2.x/WARP;_SIZE; \ | ^ In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, dareadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14In file included from ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, dat/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cppa2, flag:2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | ui: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] nt32_t 145 | dat a1, fulag1,i datan2,t flag32; | ^~~~~ 2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21_: warning: unused variable 'flag1' [-Wunused-variable] t 145 | data1, uintf3lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h2:_t da145ta1, :flag135, dat:a2, f lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncc:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int blShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ work->channelLo; | ^~~ id = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h15:218:15:: warning: unused variable 'bid' [-Wunused-variable] 218 | const inote: expanded from macro 'barrier_by_group' 29 | n t bid c= nccloSnsthme/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h int m.cwhan:218: n15: warning: unused variable 'bid' [-Wunused-variable]=e 218 | l Iconsdthre - work->chat int nbid = nncelLo; | ^~~ clShmem.channelId - work->channelLo; | ^~~ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t da/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h con:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flagst 1int b,id = ncdclShmeam.channelId - work->channelLo; | ^~~ ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCLIn file included from _/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from P/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:R173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:O warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | T O_SIMPLE]/NCCL_STEPS/sizeof(Ttid)(tid), nthreads:(nthread s), tsidIntepSizBlocke_) (th{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group read/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIdx.x), :grou254:90: p(grounote: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*p), D| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ i671 | stepSize(rstepSizee_ == 0c ? nccltShmem.co=mm.buff*Sizes[N/CCL_PRO0TO_SIMP,L Proto, 0> primsE]/NCCL_STEPS/siz eof | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h(T) : ste:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitiimple<1, 1, COLL_Uves, COLL_UNROLL>(tid, nthreads, workymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < sIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nubtn) RunWthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCLuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ _PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInB:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ evFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),:670:15: | warning: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)stepSiz,e(stepS ize_ =nthreads(nthread= s0 ? ncclS)hmem.com,m.buffSizes[NCCL_PROTO tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm._SIbMPLE]/NCCuL_STEPSf/sizeoff(T) : stSepSize_i) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ z | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.he:63:56s: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63[ | PrNimitiveCs,P 0, ProSt/sizeof(T) : o, s0> primts epSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hnote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | :254 run:90Ring, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTrUNReOLL>e(tidU, nthpreadsD, worok); w| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hn:432:78:< note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here T432 | , if (tidR < suebtn) RdunWorOkCopll()i.run(mtidple<1, 1, ,C subtOn, worLk); L_UNROLL> | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDe, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RvuFuncn(AWorkColllReducle_RIALGO(_RIN)G, N.CCL_PrROuTO_SnIMP(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:LE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPredMop, aSlgou, pmroto, u,nro ll>r().rcun(c); l\ _| ^ f/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hl:670:15o: note: afield 'nthreads' will be initialized after field 'tidInBlock' t670 | 8 , tidN(tiCd),C nthLrea_ds(AnthLreaGds)O, t_idITnBlRockE(thrEead, NCCL_PROTO_SIMPLE, Idx4.x),) gr oup| (gr^oup ), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h::670:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), nWorkBatch, algo, proto, unroll>().run(); \ | ^ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hidInB:loc670k(t:hre15adI:dx. x),note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), group(grotup)i, d| ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.heads),:670 tidInBlock(t:15h: warning: initializer order does not match the declaration order [-Wreorder-ctor] r670 | tide(tid), adntIdx.x), group(grouphre)ads(nthr,eads), ti d | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671In | Block(t h streadIdepSix.xze(stepSiz),e group(g_ro == 0up) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~? ncclShmem.comm | . tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | b stepSiuffSizze(steespSi[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here m 63 | etric<1>, Primit0iv, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558es:note: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 0 558 | runRi, Prnoto, 0> primg(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid 558:5: , ProtoSimple<2, 2, 4>, 4>' requested here 558 | srunRiung(tiud, ntnhreadWs, woork); r| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hk:432Coll, 1, 2, 4>::run' requested here 432 | R e if (dtidOp, Algo, Proto, COL < sLubtn) _RunWoUrkColNl().run(tid, subtn, w, T,o RedOrp, kAlgo,) Prot;o, COL | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12L_UNROLL>().run(tid, su | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum,b tn, rworkc); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:c22:1:l note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here _22 | fDEFINlE_ncoclDeavFuntc(All8Redu,ce_R ING_SNIMPLCE_PrCeMulLSum__f8_4A, ncLclFunGcAllORedu_ce, RFuncIPreMNulSuGm, r,cc NCCL_PROTO_SIMPLE, 2) l_f loat| 8,^ NC CL_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hALGO_RING:, N611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ note: expanded from macro 'DEFINE_ncclDevFunc' 611 | | Ru ^nWo rkB/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hatch,u palg(o, pgrotro, ounruolpl>()).r,un (); \ | | ^ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h15: note: field 'nthreads' will be initialized after field 'tidInBlock' : 670670 | : t60id(:tid ), note: nthfield 'group' will be initialized after field 'stepSize're ads (nthre670ads) | , tidInBloc k(t hre adItdx.xi), dgro(tiudp(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: ), nthreadsfield 'group' will be initialized after field 'stepSize' (670n | t h tride(atidd)s, )n, tidInBlock(threadIdx.xt)hr,ea dsg(nrthoreuadps),( tgidrInoBluocpk()th,re ad Id| x. ^~~~~~~~~~~x ), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, sub/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:In file included from 60/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grmmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | Doup), | ^~~~~~~~~~~ EFINE_ncclDevFunc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] dInBl 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x):670:15,: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_ grPoup(Rgroup)O, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ T | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671O | s_tepSize(SstepSiIze_ ==M 0 ? nPcclShmLem.comEm.b]/NCCL_STEPS/suffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303i:zeof90(T) :: ste pSizenote: _) { in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: 303note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here | Primitives, /*Direct=*/0, PdrOp, FoanAsytmmetroic<1,, NCCL_ MAX_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ D0EV_AR>ITY>, /*Diprect=*r/0, Piroto,m s | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown<0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UT,N RedORp, POrotoSLimple COLL(_UNROtLL>, CiOLdL_UNR,OLL>( tid,nthreads, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if wor k); ( | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ht:432:78i: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here d 432 | UNROLL(>().r)u.run(tidn(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(A, subtnl, workl); | R ^ e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:d1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here u c17 | DEFIeNE_nc_clDevFTunc(ARllRedEuce_TEREE_SIMPLE_PreMulSum__SfIMPLE8_PreM_ulS4um_,f8_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h ncclFuncAllReduc2e, ncc,lFunc AllReFduce:u670:15,:n warning: initializer order does not match the declaration order [-Wreorder-ctor] c FPru670 | n e tciMd(tPulSum, rccl_floid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ at8r,eM ulSNum,C rcCcl_fLloat_8,A NCLGO_TREE, NCCL_PROTO_SIMPLE, 4) CL _ALG| O_^TRE E, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hNCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unro:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ll> ()t.riun(d); (\ t| ^ i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:d670:15): note: ,field 'nthreads' will be initialized after field 'tidInBlock' 670 | n t tihd(treads(nthride), anthdreasds()nth,rea ds)t, tiidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlo670:60c: note: kfield 'group' will be initialized after field 'stepSize' (670 | t htid(rtide), anthdreaIdds(nxthr.eadxs), )t,idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, w:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlo:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShck(mthreeadIdmx.x)., grocup(goroupm),m | ^~~~~~~~~~~ .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMul/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hSum, rccl_float8, NCCL_ALGO_TREE:670,:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]NCCL_PROTO_SIM 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | roupt(grouip), d| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_( 671 | t sitepSidze(st)epSiz,e_ == 0 ? nncclShmtem.cohmm.rbuffSeizes[aNds(nthreads), tidInBlock(threadIdx.CCL_xPROTO)_SIMPLE]/NCCL_S,TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h: gr254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereou p(gr oup), | ^~~~~~~~~~~~~~~~~ 254/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670: | 60: note: field 'group' will be initialized after field 'stepSize' 670 | t Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hk(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0>/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl(R).runO(tid, subtn, woTrk); O | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp_:17:1S: note: Iin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here MPLE]/NCCL_STE17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ metric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_A/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nhmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' _PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx906. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const inwarning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->chaIn file included from _SIZE; \ | ^ nnelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int :366:15: warning: unused variable 'bid' [-Wunused-variable] w 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const i 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nt bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->cIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hannelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPL 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrea), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roup), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmemPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDev/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:Func(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId -In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.chanIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t dnelId -a work-t>channelaLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, fl:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3:2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:2929 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w =/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp :2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.ht:27:15: warning: hunused variable 'bid' [-Wunused-variable] 27 | r conste int biad = nccdlShmem.cIhanneldId - woxrk->.x/chaWnnelARP_SLo; IZE; | ^~~ \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, \ | ^ d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channaeta2, fllag2; I| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145d:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | - uint32 _t dataw1, flago1, datra2, flkag2; -| ^~~~~ >channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 366 | const int bid = ncclShIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;mem.channelId - work->channelLo; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - worIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId -k->channelLo; | ^~~ work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_P tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl(epSiz)e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direc(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uit=n*/0, tProt8o, 0> _primst | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h,:565: 5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereN 565 | C ruCL_ALGO_TREE, nTNreeUpDCown, COLL_UNROLL>(tid, nthreads, work)P;LE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h| :611:62: ^ note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h RunWo:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | rk Bat ch< col l , t y, iredfop< ty>(, talgio, dpro < tso, ubtn)unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthre RaunWodrksCol(lo().crunk(ti(d, tsubhtn,r weorka); d | ^I /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cppdx.x), :7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRegdrouup(gcrouep),_ | T ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hR:E670:60E: note: field 'group' will be initialized after field 'stepSize'_ S670I | M tPid(LtidE), _ntPreMulSum_u8_2, ncclFunchAreadls(nlthrReades),d tuidIncBloeck(,th reFaduncIdx.x), group(group), | ^~~~~~~~~~~ PreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DE670 | F tid(tIid), nthrNeads(ntEhreads), _tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizencclDoevFunc(fAllRed(uce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAT) : lstepSizle_) { R | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~educe, | F group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.huncPreMulSum, uint:63:568: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here _63 | t Primi, tNCCL_ALGO_RING, NCCL_PROTO_SIMPLivEes, 0, Pr | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'ot o 611 | RunWorkBatch<, 0c> proims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, ll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:nt15hrea:ds, worknote: ); field 'nthreads' will be initialized after field 'tidInBlock'| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h : 432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here670 | tid(tid), nthr 432e | a d if s(tid( < snubtthreads), tidInBlonc) RuknWor(kCotll(). ru670 | tid(n(ttid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFiud),n ntchre(ads(nthreads), tidInBlock(threadIdx.x), group(AllgRedruceo_RINuG_SpI), | ^~~~~~~~~~~ MPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Pri/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ mitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tiads), td%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchIn file included from , a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpplgo, proto, unroll>().run():2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h;:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670\:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | | tid ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670(:tid)15, nth: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | treads(nthreads), tidInBlock(threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), groidu(tidp), n(thregads(rnthroeads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60u:p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: | field 'group' will be initialized after field 'stepSize' tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize670(ste | pSiz e_ == 0 ? nc clShtmem.icommd.buff(tid), nthreads(nthreads), tiSdizesI[NCCnL_PROBTO_SlIMPLoE]/NCcCL_SkTEPS(/sizteohf(readIdx.x), groupT) :( stegpSizre_) {o | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p), | ^~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>()./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670: 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid)15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hDEFINE_ncclDevFunc(AllReduce_RING_SI:MPLE_PreMulSum_u8_2, ncclFuncAllRe670d:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] u 670 | c tied(tid),, nt h ^~~~~~~~~~~ FruncPreMulSum,eads(n threadus), tint8_t, NCCL_ALGO_RidInBIlock(tNhreadIGdx.x), ,g NCCL_PROTO_SIMPLE,roup( group),2 | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ )| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n/tsizehof(Tr) : setepSaize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hd:303s:90: (note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here n303 | t h Prreimads), tidInBlock(threadIdx.x)iti,ves< T, gRedrOp, oFanAusymmpetr(ic<1g, NCrCoup), L_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' : 670 | 565 :ti5d: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUp(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grouDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ */0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.homm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h | :303 :90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | tid(tid), n Pritmitivehreads, u/*Direp),ct=*/0, Proto | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM,P 0> LprfSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ims E | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h]:/565:5: Nnote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here C565 | runTreeUCpDown, COLiL_UNROzLL>(te_) { id, nt| hr ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: eads,note: work)in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h 254 | : 432:78: Pnote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here rimitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5n:WorkCo ll, ProtoSimple<1, 1, 4>, 4>' requested hereedOp, Algo, Proto, COLL_565 | runTreeUpDowno().runS(tid, isubtn, mwork); p | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cppl:7:1e: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here <7 | DEF1INE_,ncclDe vFunc(A1llRedu,ce_TRE E_SIMPLCE_PreMOulSum_Lu8_2L,_UNROLL>, COLL_ UncclFuNncAllReduce, FuncPreROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h,:611 :62: Cnote: expanded from macro 'DEFINE_ncclDevFunc' OLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:611 | 17 R:unWo1rkBa:tch< collnote: , tyin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here, red op, algo,17 pro | to, DunroEll>(F).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PrthreeadsM(nthureadls), StidIunBlomck(t_hreaudIdx8.x),_ g4, ncclFuncAllRerodup(guroupc), e| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, :670:FuncPreMulSum, u60i: note: nfield 'group' will be initialized after field 'stepSize' t6708 | _ ttid(t,id) , nNCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ etric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WAR | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ P_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCo:l670l().rutn(tid, shubtn, wrork); | e ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hadId:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEF:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buxf.x),f group(gSroup), i | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_z 671 | e stepSisze(st[epSize_N == 0 ?C ncclShCmem.comLm.buffS_izes[PINE_ncRcNlDevFOuCnc(AllTCReduceO_LRING_SI_MPLE_PrePMulSumR_u8_2, OncclFunTcAllRedOuce, Fu_ncPreMSulSum, uI iM670 | n tPid(ttid),L 8nthreEa_ds(nt]htreads)/,, tidNI nBlocCkN(threCaCdIdL_STEPS/sizeoCLf_ALGO_R(ING, NxT.x)C, g)rouCp(g rouLp: stepSize_PROTO__SIMP)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :90: Lnote: E, in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62254: note: expanded from macro 'DEFINE_ncclDevFunc' | 611 , | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | R suPtenpSiWze(rostirepSmkize_iB ta== 0t ? nccclhShmei>, al,go, proto/, un*rollD>().run(); \i rec| ^t /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h=*/0, Proto:670:,15: note: field 'nthreads' will be initialized after field 'tidInBlock' ze0 of>(T) : s670tpep | Sirz e_)i ms | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565: t5id( { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | rtuindT),r entehrUeapdsD(nothwreads),n ti, COLL_UNRO/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hreads(:nt432h:reads), 78ti:dI nBnote: loin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereck (t hreadId432x. | x if (tid < subtn) R),u gnroWupo(grrokupC),o l| ^~~~~~~~~~~l ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBluloSum_u8c_4, nkcclFun(cAllRedtuce, FhuncPreMrulSume, uinta8_t, NdCCL_ALIGO_TREEd, NCCLx_P.x), gROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : ste\ | p ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90:670::15: note: field 'nthreads' will be initialized after field 'tidInBlock' note: 670 | in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here ti d(ti d), nthrea254ds(nt | hread s), t idInB lock (thre adIdx .x)Primitiv, group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)es<,T, Re dOp,g FanArsymmoetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here up(gr565oup | ), | ^~~~~~~~~~~ runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hclShm:670:e15: warning: initializer order does not match the declaration order [-Wreorder-ctor] m670.comm.buffSizes | [tid(tid),N nCCL_PROTO_SIMPLthreEads(nth]rea/NCCds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direhcmem.cotmm.buff=Sizes[*NCCL_PR/OTO_SI0MPLE]/N,CCL_ST EPS/siPzeof(T)r oto,: step0> prSiize_) ms { | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | PrimitivesUpDow,n, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ho, Prot:o, C432OLL_U:NROL78L>(:).ru n(tinote: d,in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here sub tn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp432 | :17:1 : note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DiEFINf (tid < subEt_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllRend) RuunWocrkCeoll<,Fn, T, RFedOpu, Algno, PcrotoP, COrLL_UeNROLML>()u.run(ltid,S subutn, m, uint8_t, NCCL_Awork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ DEFILNGO_ETREE_, NnCCLc_PRcOTOl_SIDMPLEe, 4vFunc(All) R | ^educe_TREE_SIMPLE_PreM /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hu:l611:62S: note: uexpanded from macro 'DEFINE_ncclDevFunc' m611 | _ u Ru8n_Wor4kBa,tch n, algo, proto,cAl unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670lR:ed15uce:, F uncnote: Prefield 'nthreads' will be initialized after field 'tidInBlock'Mul Sum , uint8670_t, | NCC L_A LGO _TR EE, tNCCiL_PdROT(O_StIMPiLEd), nthrea,d 4)s | (^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hn:t611:hreads), ti62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAll/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 218 | coIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nst int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data\ | ^ 1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | bar/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ rier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h670 | tid(tid:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h | :611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid),), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.co nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx. 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ le<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_P1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloarkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1102. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncIn file included from clShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp145::282:: In file included from warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hunused variable 'data2' [-Wunused-variable]: 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: 175145: | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h : 80u:i5n:t 3warning: 2_unused variable 'w' [-Wunused-variable]t data1, flag1, d a80t | a 2 , fblaargr2i;e r _| b ^~~~~y _gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.ho:u145p:(35): ;warning: unused variable 'flag2' [-Wunused-variable] | ^~~~~~~~~~~~~~~~~~ 145 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h : 29u:i15n:t 3note: 2expanded from macro 'barrier_by_group'_ t data 129, | f l a gc1o,n sdta tian2t, wf l=a gt2h;r e a| d ^~~~~I dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDoIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ wn, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:d), nthre15ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, p if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, da/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const iIn file included from nt w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ onst int w = threadIdx.x/WARP_SIn file included from IZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h\ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work35: warning: unused variable 'flag2' [-Wunused-variable] 145 | - uin>t32_t data1, flag1, data2, flag2; | ^~~~~ channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h \ | ^ :In file included from 29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: atunused variable 'w' [-Wunused-variable]a1, fla g1, dat a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h75:145:21: | warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3 2_t dat a1, fl ag1,b data2a, flag2r; | ^~~~~r /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:i28: warning: unused variable 'data2' [-Wunused-variable] e145 | r ui_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:n15t32_t: data1, flag1note: , dataexpanded from macro 'barrier_by_group'2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.chaIn file included from nnelId /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | - work->channelLo; | ^~~ In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, wo:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIn file included from Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | rk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PIn file included from ROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498,:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] C496 | O tid(Ltid),L nthr_eads(UnthreNads),R wid(tOidLL>(tid%WARP,_SIZ nthreads, work); | ^ E/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | :432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCo stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)l l497 | < wFarpInnBloc,k(th readTIdx.,x/WA RP_SRIZE)e, | d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | O warp(tid/WARP_SIZE 498p, | flagThread( Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFu(ntid%c4)==3), gro(up(gAroup)l, l| ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ R| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 e499 | d sutepScizee(ncc_lShmeTm.cRomm.EbuffESizes[NCCL_PROT_O_LLS128]I/NCCLM_STEPPS/sLE_Prodi_zeoff(uin16_2, ncclFuncAllReduce, FuncProd, ht6a4_tl)) f{ ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hCC:421L:9:_ note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested hereA L421 | G O _ T prims(tid, nthreaREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | ds, tr ee- >do wn,R trunWorkBatch, algo, proto, unroleel->d>own(, w)ork.->srendbuuffn, w(ork)->r;ecv buf\f, w ork- >re| dOp ^Ar/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSg )/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070::5: 670note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here : 107015 | hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, ha: runnote: Trfield 'nthreads' will be initialized after field 'tidInBlock'eeS pli t(ntidB, nlthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hock(threadIdlf, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grox.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nutp), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nt:432h:78:r note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested heree a432 | d s i)f (,tid < tsuibdtn)I RunnWoBrkColl()o.ruun(p(group),t id ,| su ^~~~~~~~~~~btn , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFIIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tidIn file included from (/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, x),0 group>(group ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ r671 | i stepSmize(stepsSize_ == 0 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S 565 | runTreeUpDown, COLL_UNROLL>(tid, TnEPS/sizteof(hT) : srtepSizee_) { a| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupd /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hs:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl,, /*Dir ect=*/A0, Prlgooto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | ru, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_nT2reeUp,Downa, COlLLf_UNR,OLL> (tidN, ntChreaCds, wLork)_; | ^ A/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hL:432G:78: Onote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here _432 | T R if E(tidE < s,ubtn ) RuNnWoCCL_PROrkColl()TO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou.run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611611p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~: 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, alS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runT:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPreeSUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollsizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(565 | runTreeUpDow().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7nthreads), tidInB | DEFINlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | E_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGn, CCOLL_CUNROLLL>(_tid, PnthrReadsO, woTrk);O | ^_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hSIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(t:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO:670:60: note: field 'group' will be initialized after field 'stepSize' _670 | T Rtid(Etid)E, nt,hrea ds(nNthreCads),C tiL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitivesp, /*Dire(ct=*/0, Pgroto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | roup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthread s if (,tid < subtn) tRunWrorkCollego, Prdoto,own, tree-> COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAlldown, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tiRedduce, ,Func Prodn, haltf,h NCCrL_ALGeO_TRaEE, dNCCLs_PRO,TO_SI MPLEw, 2)o | ^r /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:k611:62:) note: expanded from macro 'DEFINE_ncclDevFunc' ;611 | R unWo| rkBa ^tch< coll/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h, ty, redop<:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid ty>, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n< subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here t5hre | ads(DnthEreaFds)I, tNidIEnBl_ockn(thcreacdIdxl.x)D, gerouvp(gFrouup)n, c| ^~~~~~~~~~~~~~~~~ (/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:A670:60l: note: field 'group' will be initialized after field 'stepSize'l R670e | d utidc(tide), _nthTreaRdEE_LL128_Prod_f16_2, ncclFuncAllReduce, Funs(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> primIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stes | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthrpSize(ncclShmem.comm.buffSizes[NCCL_PROTeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | f:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizelagT_hread( (tid%=4)==3=), gr oup(0group ),? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_ | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h pri:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrSeTEPSa/sizedof(T) s: ste)pSize,_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | t group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hi:d63:56: note: Iin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63n | BPrimitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing-nthr(eadsSt, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pliti, nthredads-n,threadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuffloc,k(thr eadId x.x),| grou ^p( group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplitRO n(TthrteaOds,i_ wordkS); ,I | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h nMP:432:tL78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here E 432 | ] / if (Ntid < CsubtnC) RunLWorkC_oll().erupSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56n(tid, sub:tn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1:hreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ A/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hllReduce_RIN:G_S558IMP:LE_5Pro:d_f 16_note: 2,in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here nc clFu ncAll558Red | u ce, Fu ncP rodr, huanRing(tid, nthreads, work); | ^l f, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hNCCL_ALGO:_RIN432G,: NC78CL_:PROTO_SIM PLEnote: , 2in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62432: note: | expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru nWo rkB atcih, algo, proto, unroid < subtn) RunWorkColl().run(tilld>(),.ru n()s; \ u | ^b /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ht:670n:15:, note: field 'nthreads' will be initialized after field 'tidInBlock' w670 | o rtikd(t)id);, n thr ead| s(n ^th re/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cppads), tidInB:loc12k(t:hre1adI:dx. x),note: group(gin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hererou p), 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Pro| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xd_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru)n, gWrooupr(grkoupB), a | ^~~~~~~~~~~t ch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here , proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_ stepSize(stepSize_ ==f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, prot:670:o15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ,670 | tid(t id), nuthreadns(nthrreads)o, tidInlBlock(lthre>adIdx.x(), g).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(roup(gtroup)i, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ )671 | ,stepSiz e(stepnSize_ threads(nthreads), tidInBlock(threadId== x0 ? n.cclShmxem.co)mm.buf,fSizes [NCCL_gPROTO_rS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.houp(grIMPLE]/:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloup)o, | c ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hk:670:60(: note: field 'group' will be initialized after field 'stepSize't h670 | r tied(tiad), dnthreIads(dnthrxeads.NC)xCL_,)STE ,PS/s tizegiofrd(ToI) :n steBpSizle_) {o | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c | group(groupk /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h(:254:t90: note: hin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here r254 | e aPrimditiuIpv(gderoxsup).<, xT| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_, group(group), | ^~~~~~~~~~~ RedOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78:anA symmnote: etric, 0, 2, 4>::run' requested hereX_DE V_A RITY, 1>,432 /*D | ire ct= */0, Pro to, 0> p if (tid < subtn) RunWorkColl, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, worOLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGOk_); | T ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hR:E432:78:E note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here, 432 | N iCf (tCid < Lsubt_n) RPunWRorkOColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SI 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threaMdPLEI_Prdod_xf16_.4, xnc)c,lFu ncAgllRerducoe, uFunpcPr(od, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_nc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizesclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ty>, algo, proto, unroll>().run(); \ | ^ ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid( | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(th/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hreadIdx.x), group(group),:670:15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste | tid(tid), nthreapSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLds(nEthread]s), ti/dInBlocNk(threCadIdx.Cx), grLoup(gr_oup), S| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_T 671 | E stePpSiS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:ze(st63epSize:_ == 056 ? nc:clShme m.commnote: .buffSin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested hereizes[N CCL_ 63 | Primitives, 0, Proto, 0_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereN 254 | R OPrimitLives, Fa(nAsymmtetric, /*note: Directin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here=*/0, Proto, 0> prims 432 | if (tid < subtn) Ru| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(>, ACOLLl_UNRlOLL>R(tide, ntdhreauds, cworke); _| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: Rnote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here I432 | N Gif (_tid S< suIbtn)M RunPWorkLCollE, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDownL().rGun(tOid,_ sRubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here ING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, aimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthre17a | DEdFINEs_ncc)lDev,Func (AlltReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, ^~~~~~~~~~~~~~~~~al go, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hproto, unroll>().ru:n();670 \ :| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h60:670::15: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threnote: field 'nthreads' will be initialized after field 'tidInBlock' a 670 | d tIid(tdid),x nth.readxs(nt)hread,s), tidIgnBlorck(tohreaudIdxp.x),( grougp(grroup)o, up), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.headIdx.x), :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ea| ds, wor warp(tid/WARP_SIZEk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h 498 | f:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | l if (taid < sugbtn) RunTWorkCholl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' OLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllRedead((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[N 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, algo, proto, unroll>().run();anSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: \ | note: ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hin instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid( tid), nth432read | s(nt hrea ds), tidI nBlock(thre adId x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), griof (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ up(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrenote: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' ads(nthreads), tidInBlock( 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, ProIn file included from to/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx., 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollf().runS(tid, isubtn,z work)e; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpps:22[:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereN 22 | DCEFINE_CncclDLevFunc(_AllPReduce_RRING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMOTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h0:611:>62: note: expanded from macro 'DEFINE_ncclDevFunc' 611p | r RuniWorkBmastch< coll , ty| , re ^do p, algo, pro:to, 558unr:oll>5().r:un() ; \ note: | ^ in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: 670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 558670 | | tid( tid) , nt runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl670:60:( note: field 'group' will be initialized after field 'stepSize' ) .670 | r tiud(tnid),( nthtreadis(nthdread,s) , tisdInBulock(bthretadIdnx.x),, gr oup(wgroupo), r| ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tiBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1101. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h :75145 | : 14 : warning: unused variable 'data1' [-Wunused-variable]b arrier_by_ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | cons uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.chanIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: : In file included from field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Coll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx906. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:In file included from 175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channe/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ lId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from In file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const intonst int bi d = ncclShmbem.channelIid - work->channelLo; | ^~~ d = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : warning: unused variable 'bid' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11 218 | : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const const int bid = ncclShmem.channelId - work->channelLo; | ^~~ int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nt/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFuIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)nc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpphreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | con/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hst int w = threadI:dx.x/218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid WARP_SIZE; \ | ^ = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(In file included from ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h::145:14: warning: unused variable 'data1' [-Wunused-variable]29 145 | : uint1532_t :data1, note: expanded from macro 'barrier_by_group' 29 | con flag1, data2s, flaIn file included from t int wg2 ; | ^~~~~ =/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h threadIdx.x/WARP_SIZ:145:E21: warning: unused variable 'flag1' [-Wunused-variable] ;145 | uin\t32_tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp: | ^ data1, flag1,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp: 2: In file included from d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75t:7: warning: unused variable 'w' [-Wunused-variable] a 75 | 2 ba,rrier_ by_group(f); | ^~~~~~~~~~~~~~~~~~l /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:a29:15:g note: expanded from macro 'barrier_by_group' 229 | ; c | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145onst int w = threadId:366:x15: .warning: unused variable 'bid' [-Wunused-variable] 366 | x co/nst inWt bid A= ncclRShmem.cPhannel_Id - w2S: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:oI11: In file included from r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hZ:174: k/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:E75:7:- ;warning: unused variable 'w' [-Wunused-variable] > 75 | c\ bharriera_by_gnroup()n; | ^~~~~~~~~~~~~~~~~~ e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15lLo; | ^~~ : note: expanded from macro 'barrier_by_group' 29 | const int w = :28:t warning: unused variable 'data2' [-Wunused-variable] h145 | ruint32e_t daata1, dflag1I, datad2, flaxg2;.x/WARP_S I| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hZ | ^ :145E:35: warning: ;unused variable 'flag2' [-Wunused-variable] 145 | \uint 3 | ^ 2_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | In file included from ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hby_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: 218 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h co:nst 27int :bid 15= nc:clSh warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmemm.cheannemlId .- wocrk->chhananelLno; n| ^~~ elId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hier_by:_group()11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:; 173 | ^~~~~~~~~~~~~~~~~~ : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29 :15: note: expanded from macro 'barrier_by_group' 29 | co nbsarrier_by_grt inIn file included from to w = thure/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ p(); adIdx.x/WARP | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h::11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]warning: 75 | unused variable 'w' [-Wunused-variable] bar rier_by_g roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h75:29:15: note: expanded from macro 'barrier_by_group' | 29 | const in t w = th rea barrier_by_group(); dIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, d; | ^~~~~ a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:t28: warning: unused variable 'data2' [-Wunused-variable]a 145 | 2 uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, dat/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ a2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_| ^~~ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/W/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cppA:2: In file included from R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:P11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h_:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:S80:5:I warning: unused variable 'w' [-Wunused-variable] Z 80E | ;barr ier_\by_g rou p| (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29: ^15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groIn file included from up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - bid = ncclShmem.channelId - work->channelLo; | ^~~ work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const in:366:15:t warning: unused variable 'bid' [-Wunused-variable] 366bid = ncclShmem.channelId | co-nst i nt biwd = nocclShrmem.ckhanne-lId -> workc->chahnnelLao;:366n :15: n warning: unused variable 'bid' [-Wunused-variable] e| 366 | l ^~~ cL onsto; | ^~~ int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(tIn file included from hreadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthre flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:ad9s), t:idInB locnote: k(thrin instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested hereeadId x.x), group(g421roup), | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste pSize (stepS ize _ == 0 ? ncclprims(tidShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC, ntChreadsL, t_ree->Sdown,T tree-E>down,P wS/sizeof(T) : stepSizeo_rk->)sendbu ff, w{ork-> recvb uff, | w ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hork->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, n:254:t90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereh r254 | e a Pridmitisves<,T, R edOpw, FaonAsrymmetkric), /*D| ir ^ect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hr:432:78: unote: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here n432 | T rif (teid ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DOLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670FIN | E_ ncc lDev Fu nc(tAlliRedduce(_TRtEiE_SdIMP)LE_,Pro d_fn8_2t, nhcclrFunecAads(nthreads), tidInBlock(threadIlldRedxuce., Fxunc)Pro,d, rccgl_frloaot8,u NCpCL_(ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hg:rou611p):62: , note: | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~expanded from macro 'DEFINE_ncclDevFunc' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 611 st | RunWorkBatch, algo, prot:o, u254nro:ll90>():.ru n()note: ; \in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:15:254 note: field 'nthreads' will be initialized after field 'tidInBlock' | 670 | tid (ti d ), Pnthrreaids(nmthrieadts),ives,t /h*Drireecat=d*/s0,) Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTree,U tpidDInoBlwocnk(p(g,ro upC),O L| ^~~~~~~~~~~L_ UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | O_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ st/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),epSize(st epSizeg_ == 0r ? ncoclShmeum.commp.buffS(izes[NgCCL_PRrOTouO_Sp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ IMPLE]/N671CCL_ST | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EP stepSS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COV_ARLITY>, L/*Direc_t=*/0, UProto, N0> priRms | ^ O/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565LL>(tid, :5: note: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here threads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432: 78565 | : runnote: Treein instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested hereUpDo wn, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ro | toSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Al ifl (tiRd < esubtn)d RunuWorkcColle()_.Prod_f8_2, ncclFuncrun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncAllRcelFuncAlduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, allRegducoe, ,Func Propd, rrccoto, unroll_flloa>t8, (NCC)L_.ALrGO_uTREnE, (NCC)L_P;ROT O_SI\MP LE, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 2 ) | ^670 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h | :611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ti ^d(t id)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, nthre:ads670(nt:hre15ads:), tinote: dInBfield 'nthreads' will be initialized after field 'tidInBlock'lo ck( 670 | thr ead Idxt.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), id(| tid ^~~~~~~~~~~), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreadasdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:In file included from 1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hf8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cppL_AL:G2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29:O_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,R alPgo,_ prSotoI, 670Zu | En )r t,oid l(t lid| >), ~~~~~~~~~~~~~~~~~~( n )th .re| ar stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)dsu ( nn(t497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread(()t; \i | d ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h%:6704:15:) note: field 'nthreads' will be initialized after field 'tidInBlock'= = 6703 | ) t,id( tidg), nrthroeadus(npthr(eadgs),r tidInBlock(threadIdx.x), group(group), | hr ^~~~~~~~~~~~~~~~~ead s),/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h tidI:nBl670ock:(th60re:ad Idxnote: .x), grofield 'group' will be initialized after field 'stepSize'up( gr oup), 670| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | sttepSiized(st(epStize_i ==d 0 )? n,ccl Shmnem.ctommh.burffSeizeas[NdCCLs_(PROnTO_tSIMhPLEr]/NCeCL_aSTEdPS/ssio)zupe), o | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ f | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3( T499 | ) stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEP,S ti/dInsBloick(zthreeadoIdxf.x)(, guroup(:g sirtenoupSipze_t)) 6,{ 4 | _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | t | ^~~~~~~~~~~) group(group )/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &t:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize runTreeUpDownup, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth);_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | ru n Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here TreeSplit, COLL_UNROLL>(tid, nthreads, work); | 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hPro:t432o:78L:L 1note: 2in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here8 , C432O | L L _ U8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ N R OL ifL >((ttiidd ,< nstuhbretand)s ,R uwnoWrokr)k;C o l| l ^< Fn,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h T:,432 :R78e:d Onote: pin instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here, Al432g | o , P r o tiof, (COLtLi_dU N()t.nr)u nR(utniWdo,r ksCuobltln<,F nw,ork) ;T , | ^R e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cppdOp:,7 :A1l:g onote: ,in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here P r7o | tDoE,F ICNOEL_Ln_cUcNlRDOeLvLF>u(n)c.(rAulnl(Rteiddu,c es_uTbRtEnE,_ SwIoMrPkL)E;_ P r| oIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, 0, 1, 2>::run' requested herec l Fu5 | nDcEAFlIlNRE_endcuccleD,e vFFuunnccP(rAoldl,R erdcuclc_ef_lToRaEtE8_,L LN1C2C8L__PArLoGdO__fT8R_E2E,, nNcCcClLF_uPnRcOATllOR_SeId>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) MucPeL,E , F2u)n c P| RrunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro^ o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hd,: 611r:c62c:l _note: fexpanded from macro 'DEFINE_ncclDevFunc'l o a611t | 8 , N CRCuLn_WAoLrGkOB_aTtRcEhE<,c oNlClC,L _tPyR,O TrOe_dLoLp1<2t8y,> up), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ,2 )a l | g^o ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h p:r611o:t62o:, note: uexpanded from macro 'DEFINE_ncclDevFunc'n r ol611l | > ( ) . rRuunn(W)o;r k\B a t| c ^h ,t iadl(gtoi,d )p,r onttoh,r eunraodlsl>((n)t.hrreuand()s;) , \t i | d ^I nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthread s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0,ple<1, 1, COLL_UNROLL>, COLL_UN RPOLL>(tidroto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2C: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: LIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h_:498:29: Pwarning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tiRd(tid)O, nthTreads(Onthread_s), wiSd(tid%WIARP_SIZME), waPrp(tidL/WARPE_SIZE)]/NCCL_S, | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThreaTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpd((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL17 warnings generated when compiling for gfx1030. _PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives,n COLLS_UNRyOLL>m(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, O0, Pproto,, 0> primsA | ^l /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:g1062:5: onote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here ,1062 | PrunRirngL(ti_d, ntUhNROLL>().rreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllRep,d Alugo,c Proeto,, CO LL_FUNRuOLL>().runn(tcid,P srubotn,d wo,rk) ; | r ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:ccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | 10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128,^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hk:(670:15: warning: threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671initializer order does not match the declaration order [-Wreorder-ctor] | 670 | ti d(ti d), n threasds(nthtreads)e, ptiSize(stepSize_ == 0 ? nccldInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_{ 671 | stepS| ize( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~step Siz e_ ==| 0 ? group(groupncclS hmem./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hcomm.buffSizes[N:CCL_63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives: ste,pSiz e_) {0 | , Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, /*LDirectL=*/0>, Prot(o, 0>/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h) prims. | ^ r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereu 565 | n r(unTreteUpDoiwn, ,COLL_ UNROLsL>(ubtn, wortid, nthreads, work); | ^ SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h,:432: 78:r note: cin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here c432 | l _ fif l(tioat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2d) < sub tn)| Ru^nW ork/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hColl (). ru n(Rtidu, snubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrePaLEds(nthreads), tidInBlock(t_Phrord_ef8a_2d, IncdclxFu.ncxAl)lR,ed ucge,r FounucPpro(d,g rrccol_floaut8p, )N,C CL _ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h| :611 ^~~~~~~~~~~~~~~~~:62 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hnote: expanded from macro 'DEFINE_ncclDevFunc' :611670 | : RunWorkBatch, algo, proto, unroll>().run(); \ 60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nt h| ^r /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hea:670:15: note: dfield 'nthreads' will be initialized after field 'tidInBlock' s )670 | , ttiid(dtiId)n, Bntlhrock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threa Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>()./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hrun(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(ti:670:15d: warning: initializer order does not match the declaration order [-Wreorder-ctor] ) 670 | , tid(tid) , ntnhreadts(nthhreards),e tidaInBlodck(tshrea(dInthreads), tdxi.x),d groIup(gnroBlock(threadIdx.x), up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bufgrofup(gSroupi), z| ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hzeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Pr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SI565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ WorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8| ^~~~~~~~~~~ _4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNRO L tid(tLid), >nthre(ads(nthreadstid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if ),( tidtInBlocik(thrdead Idx<.x ), grsoup(gruoup),b | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ n671 | ) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCOLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(t(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group17 warnings generated when compiling for gfx1201. (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), In file included from t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Pr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidimitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thre Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | .tcomm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T algo, proto, unroll>().run) : stepSize_)(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, Fun17 warnings generated when compiling for host. cProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h), group(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nth/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)reads(nthreads) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:121: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:: 14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: In file included from warning: unused variable 'y' [-Wunused-variable] 77 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :uint32_t 12y, head, mantissa; | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_byt dat_a1, flagg1, data2r, flag2; o | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:u145:21: warning: unused variable 'flag1' [-Wunused-variable]p 145 | ( uint32_)t data1,; flag1, d ata2, fl ag2; | ^~~~~| /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28 ^~~~~~~~~~~~~~~~~~: warning: unused variable 'data2' [-Wunused-variable] 145 | uin/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.ht32_t data:29:15: note: expanded from macro 'barrier_by_group' 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint329 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrierIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, da/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, In file included from flag1, d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2a: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75a:7: warning: unused variable 'w' [-Wunused-variable] 275 | , barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->cIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();hannelLo; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const intIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] bid = ncclShmem.c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barhannerlId -i worek->chrannel_Lo; b| ^~~ y_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bidIn file included from = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218:366:15: | warning: unused variable 'bid' [-Wunused-variable] 366 const i | n contst inbt bid i= ncclSdhmem.c hannel=Id - w ork->cIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ncclhaSnnelLoh; | ^~~ mem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grouIn file included from p/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid =/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tiIn file included from d, n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cppt:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hh:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]e 670 | a ds, work); ti d(tid),| n ^ thr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: eadsnote: (nthrin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) Readus), tidnInBlockW(thorkColl_ == 0 ? ncc().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReducelSh_memT.comm.RbuffSizes[NCCEL_PROETO_SIMP_LE]/NSCCL_STEIPS/siMzeof(TP) : sLtepSEize_) _{ Prod_ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ u| 32 group(group_2, ncclFuncAl l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hReduce, FuncProd,:254: uint32_t, NCC90: Lnote: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here _254 | A L PrimGitiOves, /*Direct=*/0,L_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBat Procto, h0>< pricms | o ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hll, ty, redop, algo, proto, un:565:r5: note: oin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here l565 | l runTreeUpDown()t.oSimprle<1u, 1, nCOLL(_UNR)OLL; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' >, COLL_UN670ROLL | > tid(tid), n(tidt, nhthrreades, aworkd); s | ^( n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ht:432:78h: readnote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorks), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' Coll<670Fn, | T, Red Op, Alg o, tProito, dCOL(L_UtNROiLL>d()), nthreads(n.trunh(tird,e suabtnd, wsork)), t; i | dI ^n /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cppBlock(threadIdx.x), g:7r:1: onote: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here u 7p | DE(FINgE_nrccloDevuFunp), | ^~~~~~~~~~~ c(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,( T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ oup(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | PrimitivesI, /d*x.x)Direct, grou=*/p(g0, Proto, 0> roup)p, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ r ims | ^ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.b/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hu:565:5: note: fin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565fSizes | [ runTreeUNpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tidPRO TO_SIMP, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here u303 | n Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:go, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_T note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereR 565 | E rEunTre_eUpDoSwn,lFuncAllReduce, COLFL_UNIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.hRuOLL>(ntidc, Prod:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , ntuhint32_t, NCCL_ArLeads,G wOor_k)TREE, N; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(CCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_< 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Fn, T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hid(tid), :670:n15: warning: tinitializer order does not match the declaration order [-Wreorder-ctor] h670 | r tide(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ht:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ aid)d, nthsread(s(nnthretads),hreads), tidIn tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSizeIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ (stepSBliock(tzhreaedIdx_.x) , g=roup=(gro up),0 | ^~~~~~~~~~~ ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/Block(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, su, RedOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().rubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] d670 | x tid.(tid),x nthrea)ds(nthr,eads), tgroup(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl()./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevrun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Func(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBT, RedOp, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) Runatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) :611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | 497 Ru | nWork Batch< coll, ty, redop, warpInB algol, prock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagToto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60hread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint:6 note: field 'group' will be initialized after field 'stepSize' 4670 | _ tidt(tid)), nt)hread s(nth{reads ), ti dInBl| ock ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | P(thrreadIidx.x), group(gromup),i | ^~~~~~~~~~~ tives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u3/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 2_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDeIn file included from vFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Run/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2W: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11o: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173r: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670k:15: Bwarning: initializer order does not match the declaration order [-Wreorder-ctor] a670 | t tid(ctid),h nthre, algo, protgrooup), , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ u671 | nstepSrize(stoell>().runpS(ize_ )== 0; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(LtE]/NChCL_STrEPS/siezeofa(T) d: steIpSized_) { x| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hx:63):56: note: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Pgrimitrives, r0, Prooto, u0> pripms | ^) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558,:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group:)670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor], 670 | tid(tid ), nthre| ads(nthr ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~eads), t idInBloc k(threadI| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_dx.x) 671 | , gr oup/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepS(groiup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~z | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ e_ == 0 671 | ? ncclShmem.comm.buffSizes[ sNtepSize(CstepSizeC_ == 0 L? ncc_PROTO_SlIShmem.comMPLE]/NCCL_Sm.buffSiTzes[NCEPCL_PROTO_SIMPLE]/NCCL_STS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested heremme tric< 1, NCCL_MA565X_DEV | _A RI ruTY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreenTreeUpDown,RedO p, PCrotoSOimpleL<1, 1L, C_UNROLL>(tiOdLL_UN,ROLL> , COLnthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | iL_UNfROLL> (tid,( nthretads, iwork)d; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h<:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here s432 | ubtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduc>(e).run,(tid, subtFn, wourk); n | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cppc:P17:1: note: rin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here o17 | DEdFINE_n,cclD evFuunc(AlilRnt32_t, NCCL_ALGO_TREE, NCCLe_ducPe_TRREEO_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ TO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru:nWo670rkB:at15ch<:col l, note: ty,field 'nthreads' will be initialized after field 'tidInBlock' re dop , a670lgo | , p rot o, unr oll>t().iru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primind();( \ t | ^i /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hd:)670:15,: note: field 'nthreads' will be initialized after field 'tidInBlock' n670 | t h tird(teid)a, ndthrsead(nthreads), tidInBlotives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nth r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:e670:60a: note: dfield 'group' will be initialized after field 'stepSize' s670 | ) , ti d(titd),i ntdhreIadsn(ntBhrelads)o, tcidIknBl(ockt(thhreadreadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, //builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunW:o670:15:r warning: initializer order does not match the declaration order [-Wreorder-ctor] k670 | C tido(tid)l, nthlreads<(nthreFads),n tidI,nBloc k(T, RedOp, Algo, PthreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:pSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCroto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_L_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | ruSIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gnTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid r().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] Proto, 0> 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_L_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s t670 | e tidpSize(stepSiz(tied), n_thread s(nth=re= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().ruads), tindInBlo(ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(nAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreadsthreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UN/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here ROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grun(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf670f | tSid(tidi), nzthreades(nthrseads), tidInBlock(th[readIdNx.x), Cgroup(gCroL_PROTO_SIMPLE]/NCupC), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ _ 671 | S TEPS/siz stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLEeof(T)] : ste/pSize_N) { C | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupC /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hL_STEPS/:303:90:s note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here i303 | z Priemitiveso, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:f (T) : note: stepSiin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hereze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: 565note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | | Primi tives< T, RedO p, FanrAsymmeutric<1n, NCTreeUpCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollr, COoLL_UtNROLoL>(t,id, nthrCeadOs, woLrk);L _| ^ U/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hN:R432:78: Onote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here L432 | if (tid < subtLn>().)run( tid,R subutn, nworkW); o| ^rkColl, 0, 2, 4>::run' requested here p, Algo, Proto, COLL_UNROLL>().run( 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE,ti d, NsubCtn, CworLk); | ^_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cppPR:17:O1: note: Tin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here O_ 17S | DEIFINME_nPcclDLevFEun,c(A llR4edu)ce _TR EE| _SIMPLE_Prod_u32_4, ncclFunc^A /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hl:611l:62:Reduce, FuncProd, uint32_t, NCCL_ALG note: expanded from macro 'DEFINE_ncclDevFunc' O611_ | T RRunEWorEkBa,tch R, aOlgoT, O_SIMPLE, 4) | ^proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid( /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | , NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ hmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PRO:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch nthre,ads(n threadas), tildInBlogco, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here , 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] (AllR 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProdrunRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611Batch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl<:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)Fn, T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid( 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x),/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorem.comm.buffkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFuncSize(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ s[NCCL_PROTO/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ _SIMPL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1101. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: | warning: unused variable 'w' [-Wunused-variable] 75 | b arrier_by_g roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15u: note: expanded from macro 'barrier_by_group' 29 | i const innt w = threatdIdx.x32_t data1/WARP_,SIZE; flag1, data2, flag\ | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from eadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uin75t:7: warning: unused variable 'w' [-Wunused-variable]3 75 | 2 ba_rrier_tby_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1,15: note: expanded from macro 'barrier_by_group' 29 | const int w = thre flag1, data2, flag2; | ^~~~~ adIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15:In file included from warning: unused variable 'bid' [-Wunused-variable] 366 | const int /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cppb:id = nc2clShm: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:em.channelId - work->channelLIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIo; d| ^~~ x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid In file included from = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ , data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*DireSct=*/0, iProto, 0>z prims e | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hs:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here [ NCCL_PROTO_SIMPLE]/NCCL_STEP565 | runTreeUpDown, COLL_UNROLL>(tid, ntS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDhreaods, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunwn, COLL_UfNSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EE_SIUMPLE_NProd_uR8_2, ncOclFuncALllReduceL, FuncP>rod, ui(nt8_t, )NCCL_A.LGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, alrun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t,g o, proNto, unrColl>().Crun(); L\ | ^ _/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:A15: note: field 'nthreads' will be initialized after field 'tidInBlock' L670 | G tid(tiOd), nt_hreaTREE, NCCL_PROTO_SIMPds(LnthreEads),, tidI nBlock(threadId2x.x),) grou p(gro u| ^ p), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | In file included from tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_ize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDownds(nth,reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_ COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1:STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, algo, proto, unroll>().run(); \ | ^ RITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreadsOLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCC), tLidI_nBlAocLk(tGhreOadI_dx.Tx),R EgroEup(,gro up)N, C| ^~~~~~~~~~~ CL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ (tid%4)= =3), group(group)671, | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 | 499 | stepSize( ncclShmem .comm.buf fSizes[NCCsL_PROTO_LtL128]/NCCeL_STEPpSize(stepSize_ == 0 S/si?zeof(uint6 4_t))ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPL E{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ]| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/:461:N9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here C 461 | C Lprims(t_id, ntShreaTdsSpliEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | PrimitivesLdown, _&treeM->up, Awork->Xsendbuf_f, woDrk->rEecvbufVf, work->redOpArg,_ 0*ProtAo::MaxRGroupWIidth);T | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hY:1070:>5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here ,1070 /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | | runTreeSplit(tid, nthr runTreeUpDown, COLL_UNROLL>(tid, nthreads, ewads, oworkr); k| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h);:432:78 : note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432| | ^ i f (t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hid < subtn) RunW:orkC432oll<:Fn, 78T, R:edOp , Alnote: go, in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested hereProt o, C OLL_UNROL432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(A/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, duNce_TRCEE_LCL128L_Prod__u8_A2, nLcclFGuncAOllRe_duce,T FRuncPErod, Euint,8_t, NCCNL_ALCGO_TRCEE, LNCCL__PROPTO_LRL128O, 2)T /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr | ^O /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h_:611:S62: Inote: expanded from macro 'DEFINE_ncclDevFunc'M 611P | L RunEWork,Batc h, algo, proto, unroll>(). | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,r un(a); \l | ^g o, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' oup(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.:x670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]) ,670 | tgid(trid),o nthureadsp(nth(readgs), rtidIonBlouck(tphreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize),_ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h=:=670:60 : note: 0field 'group' will be initialized after field 'stepSize' 670 | ? tidn(ticd),c ntlhreSahmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sids(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI*/0, dProto,x 0> pri.ms | ^x /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:)565:5: ,note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | rugnTreeUprDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) Ruup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : nsWorkCtoll().r un(t{id, s ubtn, work| ); ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, RedOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDownALGO_,TREE , NCCCL_OPROTLO_SIMLPLE,_ 2) U | ^ N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:R611:62:O note: expanded from macro 'DEFINE_ncclDevFunc' L 611 | L > Run(WorkBtatchi,t algho, prroto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreadesads,( wornk); t | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hh:r432:78: enote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here a432 | d s if )(ti,d < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFu tindIncBlo(ck(AthrleadlIdxR.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educe_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here ock(threadI12dx.x) | , groDup(EgroupF), | I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_N 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | E_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h, p:roto432, un:roll78>().:run( ); \note: | ^in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 432670 | | ti d(ti d), nthr eads( nth readsi), tfidIn Bloc(k(thtreadIidx.xd), grou().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd,Block(threadIdx.x), group(group), | ^~~~~~~~~~~ uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groutphreads(, work); g| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:r432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here o432 | iuf (tid < psubtn) Ru)nWorkColl,().r un(tid, s ubtn,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize work);( | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpps:12:1: note: tepSize_ == 0 ? nccin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Al/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prim:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ lShmem.comm.671 | stepSizebuffSizes[(NCCLs_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:l56Reds | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uce:_RING_SIMPLE_Prod _u8_2,note: ncclFin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereuncAll Reduce , FuncProd, u63int8_ | t, NCC L_ALGO _RING , NCCL _PROTOP_SIMPLrE, 2imitives, 0) | ^, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611 :62: note: Pexpanded from macro 'DEFINE_ncclDevFunc' 611 | r RuonWorkBattch prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:l, ty, redop, algo, proto, unrtepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiz558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitivesr().riun();c \ <| ^ 1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h,:670: 15: note: Nfield 'nthreads' will be initialized after field 'tidInBlock' C670 | C tLid(t_id),M nthAreadXs(nth_readDs), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | E V_AtRITY>i, d/*Di(rect=t*/0,i Prodto, 0)> pr,ims | ^ n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.ht:565:h5: note: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565e | a rundTreesUpDo(wn, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | ock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | :tid670(ti:d),15 nt:hre adwarning: s(ninitializer order does not match the declaration order [-Wreorder-ctor]thr ead s), tidI670nB | loc k(t hrea dId x.xt), igrodup((grotup),i | d ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h):670,:60: note: field 'group' will be initialized after field 'stepSize' n 670t | h trideads(nthreads), tidInBlock(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROL tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | L>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevF stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, algo, proto, MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou p ) , i f (| ti ^~~~~~~~~~~~~~~~~d /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h< subtn:) 670Ru:nW60o:rk Conote: field 'group' will be initialized after field 'stepSize'l l< Fn, T670, | Re d O p, Atlgio,d P(rottoi, dCO)LL,_U NRnOLtL>h()r.erauds(nthreads), tidInBlock(thn(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2readIdx.x), group(group), | ^~~~~~~~~~~ , ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ metric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hSTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().:670:r15: warning: initializer order does not match the declaration order [-Wreorder-ctor]u n670 | ( tid(ttid),i nthrdeads(n,threa ds), stidInuBlockb(threatdIdx.nx), g,roup( growup), o| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | r tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671k | )stepS;ize(st epSi | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DzEe_ =F= 0 I? ncNclShmEem.c_omm.nbuffcSizecs[NClCL_PRDOTO_eSIMPvLEFunc(]/NCCL_STEPAllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^S/sizeof(T ) : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hstepSize_) { :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~611 | group(group: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h62:254::90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here note: 254 | expanded from macro 'DEFINE_ncclDevFunc' Pr imitives611k, /*BDireact=t*/0,c Proht, ao,l 0>g proims , | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hp:565r:5: onote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here t 565o | , un runTreroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxeUp.Dowxn), C,OLL _UN ROLL>(| tid, nt ^~~~~~~~~~~~~~~~~hre ads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, work:); 670| ^: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h60::432: 78:note: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herefield 'group' will be initialized after field 'stepSize' 432 | i670f ( | tid < sub tn) tid(tid), nthreads(nthreads), tidInBlock(threRunWorkColl()adIdx.x), group(group), | ^~~~~~~~~~~ .run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_In file included from TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :11: | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: ^175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h :498:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] :496 | 670 : ti15d(:ti d),note: ntfield 'nthreads' will be initialized after field 'tidInBlock'hre ads (nthrea670ds) | , tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ %4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threa< subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u8_2, ncclFuncAllReduce, FduIdx.xn), grocup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | PrimiProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ tives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem 611 | RunWorkBatch, algo, proto, unroll>().run().;comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x) \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hzeof(T): : stepS558ize_) {: | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h::63: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWor, 0,k Proto,C 0> prioms | ^ l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5l: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558< | ruFnRingL(tid_, ntUhreaNds, RworkO); L| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hL:>432:78:( note: ).run(tid, subtn, worin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ruon, CWOLL_oUNROrLL>(k).run(tiBd, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLEatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(t, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.idx), n)thre,ads(n thregads)r, tiodInBulopck(th(readgIdx.rx), ogrouup(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizesup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NstepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(ize_t) { | i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90, nthreads, work:) note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here ; 303 | Pr| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:imi78tives<:T, Re note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested heredOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ 432 | if (tid < subtn) RunWorkColl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().]run(t/id, suNbtn, Cwork)C; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NLC_STECPS/siLzeof(_T) : sAtepSiLze_) G{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h_TR:63:56: Enote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here E63 | ,Primi tives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n>t(tidh, n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ thrreaeds, waork); d s(nthr| ^e /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ha:432:78:d note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here s432 | ) i,f (ti d < stubtn)i RundInBlock(threWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLEIn file included from , 2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp): 2 : | In file included from ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h175:: 611/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:62::498 :note: 29expanded from macro 'DEFINE_ncclDevFunc': warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 611 | RunWorkBatc h496< | c o l l ,t itdy(,t irde)d,o pne,a dasl(gnot,h rperaodtos,) ,u nwriodl(lt>i()d%.WrAuRnP(_);S I\Z E )| , ^ warp(tid/WA/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hR:P670_:S15I: Znote: Efield 'nthreads' will be initialized after field 'tidInBlock') , | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 670 | t497i | d ( t i dw)a,r pntInhBrleoacdsk((tnhtrheraeaddIs), dtxi.dxI/nWBAlRoPc_kS(ItZhEr)e,a d I| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x . x| ) warp(tid/WARP_SIZE, group (498g | r o u p )f,l a g| T ^~~~~~~~~~~~~~~~~h read/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h(:(670t:i60d:% 4note: )field 'group' will be initialized after field 'stepSize'= =3), gr o670u | p ( g r otuipd)(,t i d| ) ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~, n| t warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3h reads(nt h499r | e a d s )s, tetpiSdIizneBl(oncckc(ltShhrmeeamd.Icdoxm.mx.)b,u gffrSoiupze(sg[rNoCupCL)_,P R | O ^~~~~~~~~~~T O_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRedIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stuce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ epSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeU/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: unused variable 'w' [-Wunused-variable] warning: 75unused variable 'data2' [-Wunused-variable] | ba rrier_by_145grou | p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :29 :15: note: expanded from macro 'barrier_by_group' uint32 _ 29 | t c data1, flagonst int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: In file included from warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:297: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | c:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ onst int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hhannelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpph:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:m27:15: warning: eunused variable 'bid' [-Wunused-variable] m27 | .channelId - cons t intw biod = rncclSkhmem.c-hanne>lId -c workh->chaannelLo; | ^~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nnelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channe:218:15: warning: unused variable 'bid' [-Wunused-variable] lId - 218work- | >chan ne const int bid = ncclShmem.chalLo; | ^~~ nnelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZEIn file included from ; \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] | 145 | uint32_ ^ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = th uint32_t data1, flag1, data2, flag2; | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h: ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h75:145:28:: warning: unused variable 'data2' [-Wunused-variable] 7145 | :uint32 _t datwarning: a1, flunused variable 'w' [-Wunused-variable]ag1, d ata2, flag2; | ^~~~~ 75/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: | warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3readIdx.x/WARP_SIZE; \ | ^ 2 _t dat a1, fl barriag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | In file included from const int bid = ncclShmem.chaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nnel/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bidI d - wo=rk->chan nelLo; n | ^~~ cclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdxIn file included from .x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:z670:15: warning: einitializer order does not match the declaration order [-Wreorder-ctor] 670 | o tid(ftid), nt(hreads(nuthreads)i, tidInnt64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hBlo:ck(thr461eadI:dx.x9), gr:ou p(grounote: p), in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step461Size | pr(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | TO_ SIM PLE]/ NCCL_ STErPS/siuzeof(nT) TreeSplit, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here L128, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, 565s | u runTbreeUptDownn, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) L L_UNROLL| >()^.ru n(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.htid, s:ubt611n, :wor62k);: | ^ note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:expanded from macro 'DEFINE_ncclDevFunc'7 :1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 6117 | D | EFI NE_n ccl Dev FunRcu(AlnlWoRerdukceB_TaREtE_cSIhMP,L aGlgOo,_ pTroRtoEE, NCCL_PROTO_S, unroll>().run(); \ | ^ IMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(t15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeoid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(f(T) : gstepSirze_) { o | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupu /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hp), | : ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepS 254i | z Preimitiv(es, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | h memr.comm.ubuffSizneTreeUs[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, wpDowon, 1, COrLL_UNROeLL>, CcOLL_UNvROLL>(tbid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollpredOp,Arg, 0*PrAotlo::MgaxGroo, Proto, COLL_UNROLL>upWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432().run | (tid, subtn, wor k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7: 1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_nc clDeivFunfc(All Re(duce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPtid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ LE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPIn file included from S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Pri/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cppmitives<:T, RedO2p, FanA: symmetIn file included from ric<1, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] NC 496 | tid(CL_MAX_DEtV_ARITYi>, /*Ddirect=)*/0, P,roto, 0> prinms | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hthreads(nthrea:565:5:d note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUps), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInDownt, COhLL_UNRrOLeadIdxL>(ti.d,x /WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ nt hreads| , work); warp(tid/WARP_SIZE | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h 498 | flagThread((tid:432:%78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here4 432) | = i=f (t3id < )s, group(grouubtpn) R)unWo,rkCo l| ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | l2().r8un(ti]d, s/ubtn,N worCk); C | ^ L/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | : 7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEF INE _nccplDevFrunc(iAllRmeducse_TR(EE_SItMPLEi_Sumd_bf8_-2, ncclFunncAlltRhreadsSplit, nthreads-netduceh, FunrcSume, rcacl_bdfloast8, SNCplit, &tree-CL_ALGO_TREE, NCCL_PROTO_SIMPLE>up, tree->down, work->sendbuff, work->recvbuff, | ^, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h::6111070:62: :note: expanded from macro 'DEFINE_ncclDevFunc' 5611 | : Ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] nWorkB atchnote: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ runTreeSpli, aRlgo,e prodtOp, ProtoLL12o,8 un,roll >().rCun()OLL_UNR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7OLL>(tid, nthr; | DEFI\NE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLEe | a ^ d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15s: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | , In file included from wortid, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k();/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | tid), nthreads(nthreads), tidInBlock: (In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:t11h: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:r173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670e:15: awarning: initializer order does not match the declaration order [-Wreorder-ctor] d670 | I dtid(txid),. nthxreads(nth r if (etid a< sdubtn)s Run)Work,Coll ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp, : | 5 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_: 1 :671 | note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested herest ep Size5(s | teDpSEizFe_I =N= E0 ?_ nnccclSchmlemD.ceomvm.FbufufnSizces([NACClL_lPRROTeO_dSIuMPcLEe]/_NCTCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | R group(groupEE /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h_LL128_Su:m_254bf:8_290, :nc cnote: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here l254F | un c AllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, N C PCriLmi_tivPesR62, :/* Dinote: reexpanded from macro 'DEFINE_ncclDevFunc'ct =* /0,611 P | ro to , 0> prRimus n | W ^ o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hr:565k:5B: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereatch, algo, proto, unro 565 | runTreeUpDown, COllL>L()._ruUNnR()O; L\ L | > ^ (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWor/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatckBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ h, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) R | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,em.comm.buffSizes[ NalgCo, proCto, uLnroll_>().rPun();R \ | O ^ T/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' O670 | _ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tiSIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Fn, T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidIn file included from InBlock(threadI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cppdx.x:),2 gro: up(In file included from group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h), : | ^~~~~~~~~~~~~~~~~ 11/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: :670:60In file included from : note: field 'group' will be initialized after field 'stepSize'/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nt 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf8_2, ncclFtid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf8_2, uncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ncclFu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:ncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:670::15: warning: initializer order does not match the declaration order [-Wreorder-ctor]565 670: | ti5d(tid):, n note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | rthreuads(nthrneads)TreeUpDown, COLL_UNROLL>(tid, nthreads, work); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here cclShmem432.c:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o | mm. bu f fSi if (tid < subtn) RunWorkColl().run(tid, | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hs:254:90:u note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here b254 | tn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp Primitives<:T, Re17dO:1p,: note: FanAsymmetric, /in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllR*Direect=*/d0, Puroto,c 0> perims, | ^ FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hO:565:5:T note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here O565 | _ runTSreeUpIDownnote: ,expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchl(tid,, nthr eads,t work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hif:670: 15:(t warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ny,t redophr, algoe, proato,d unrosll>().(run()n; \ | t ^ h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15r: note: field 'nthreads' will be initialized after field 'tidInBlock' e670 | a tdiid () .run(| tid, ^~~~~~~~~~~~~~~~~ sub tn, w/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hor:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), k)n; t| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllRedhreads(nthreads), tidInBlock(threadIdx.x), group(group), u ce, | Fun ^~~~~~~~~~~cSu m, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, 670 | t /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTrid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : seeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bflometric<1, ().Nrun(tiCd, subCtn, worLk); | _ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cppM:17:1: Anote: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here X17 | D_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^EF INE_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | nr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hunTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RRedeuce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCLdO_p, PrPotoSRimOple, CMOLL_PUN:R670:15:O warning: initializer order does not match the declaration order [-Wreorder-ctor] L670 | L t>(tid, nthreads, wido(tidr), nkthre)ads(;nthr eLE a, 4| d) ^s) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h, | t^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:i:611432d:62::78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereI nB lock(t432 note: h | expanded from macro 'DEFINE_ncclDevFunc' 611r | e a d R IunidWofxrk .Ba(xtct)h<,icod < subtn) RunWorkCollgroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stO , p?al, go n, Acprlcotglo,oS u,hnrm olePl>mr().o.rctunoo()m,; \ m C.| ^Ob /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hLu:670:15: note: ffield 'nthreads' will be initialized after field 'tidInBlock' f670S | i tzid(etsid)[, nNthLCr_UCeNRLaOLd_L>sP(()R.nrOutnT(htOrid_e, SsIMPLE]a/ds), tidInBlubton, cworkk);( | t ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpphNCrCL:e_Sa17TEd:PS/1siz:eI dxnote: .oin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduf(Tc) : stepSeize__) {T | R ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ E| group(group E/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h_:303:S90: note: Iin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here M303 | P Lx)PE, r_griSoumup(imgrt_ouibp)vf, 8e| ^~~~~~~~~~~~~~~~~_s /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h4<:,T670: ,60:n note: cRfield 'group' will be initialized after field 'stepSize' ce670dl | OF pu t,nid c(tFAida)l,n lnAtRhsreeyaddmus(mncteherte,r aidFsc)u,< nt1icd,ISnB luNocmCk(,Cth Lrer_adIcMdxcA.xlX),_ _gbDrofulEVoat8, NCCL_ALGO_TREE, NCCL_PROTO__ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:p565(gr:oup5), :| ^~~~~~~~~~~ note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreSIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchtoS,imp le<1a, 1l, CgOoLL_U,NRO LLproto, unroll>().run>(, C)OLL;_UN ROL\L>( tid , n| thr ^ead s, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hwo:670:15r:k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hnote: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tid:432:I78: nnote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here B 432 | l o cif k(tid( < tsubhtn)r RuenWoarkCodll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, Fundx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < su/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comRedOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfl, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670oat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives,r /*Direect=*/0, aProto, 0d> prims s | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h(:565:5: note: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565t | ruhreads),nTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | tidIn ^Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, , Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSi z tid(teid), _ ==nthreads (nthread0s), tidI nBlock(?threadId x.x), gronucpclShmem.comm.(groubp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ff 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeofSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitivesds), tid,InBlock( threadIdx/.x), g*roup(groDup), | i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ r671 | (Te ) : stecpsSize_) tt{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ =e| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h*p:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereS/ 303 | 0i Priz,mitivese, prim_ == 0 ? ncclShme FanmAsym.metrcic<1o, NCCmL_MAmX_DEV._s bA u| ^ f/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:Rf565:5IS: note: Tiin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here zY565 | e> s,[NCCL_ /*Direct=*/0, Proto, 0runT>reeU pDowpn, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTrOLLe_UNReOLL>U, COLpL_UNDROLLo>(twnid, nthreads ,: s tepSwize_)o { r | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ k| group(group) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h; :63:56 : note: | in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 ^ | Pr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.him, 0, 2, 4>::run' requested here 432 | if, R edOp,( FantSymmietrd < subtn) Ruic, W0, Porotor, 0CkOL>LC_U NRopOLLl>r, lCiOL(, tid| , n ^Tthr ,ead s,R woerk);/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hd | O ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | :432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | r p, u Algoni, PRfroit o,n( COtLL_UiNRgOdLL>< ()T, R< subtn) R.runun(tWid, osedOrkColl, 0, 2, 4>::run' requested here,C P roO toL, COLL17L__U | NUROLLND>(tREid, OFnthLIreaLNds,>E wo_(rk);n) c. | ^ c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hlr:432:Du78: ennote: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herevFunc(All R432 | e duce_TREE_SIMPL iE_Sum_bf8_4, ncclFuncAllfR (etid d< usubtcn) RuenWor,kCol l().ru:n17:1: (note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here t17 | DEFINE_ncclDevFiuncd(All,Redu ce_TsREE_uSIMPbLE_Stumn_bf8,_4, ncclwFuncoAllRerdCuCkL_cA)LGeO;_T,R EE ,F NCCuL_PR nOTO| c_SI ^SMPLEu, 4 m, rccl_bfloat8, NCCL_ALGO_TRE) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cppN:22C:1C: Lnote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here_ P22R | DEOFITNEO_n_ccSlDIevMFuPncL(AEllR,ed uc4e_)RI NG _S| IMPLE_Sum_bf8_^4 ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :n611:62:cclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING ,note: expanded from macro 'DEFINE_ncclDevFunc' N611 | C C L R_unPWoRrkOBaTtcOh<_coSllI, MtyP, LreEdo,p< ty4>,) a lg o| , ^pr /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hoto, :un611ro:ll62>(:). runote: nexpanded from macro 'DEFINE_ncclDevFunc'() ; \611 | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h : 670:15R:u note: field 'nthreads' will be initialized after field 'tidInBlock'n Wor670 | k B taid(tid), nthreads(ntcth,d aIlgno,B plroock(threadIdx.x), grotuo,p u(ngrorllo>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' up ), 670 | | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: 670: 60: tid(tid), nthreads(nthreads), tidInBlock(threanote: dfield 'group' will be initialized after field 'stepSize' I d670 | x . x t)id,(t idgr)o,u pnt(hrgeardso(nuthpre)ad,s) , t| id ^~~~~~~~~~~~~~~~~In Bl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hock(:thre670ad:Id60x.:x) , note: grfield 'group' will be initialized after field 'stepSize'ou p (gro670up | ), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hhreadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.cIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hannelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.heduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid),x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ nthr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.he:a670d:s15(:n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads), ti d670I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I n B| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_o ck(threadIdx.x )671, | g r o uspt(egprSoiuzpe)(,s t| e ^~~~~~~~~~~~~~~~~p Size/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h_: 670=:60=: 0note: field 'group' will be initialized after field 'stepSize'? ncclSh m670e | m . co m mt.ibdu(ftfiSdi)z,e sn[tNhCrCeLa_dPsR(OnTtOh_rSeIaMdPsL)E,] /tNiCdCILn_BSlToEcPSk/(stihzreeoafd(ITd)x .:x )s,t gerpoSuipz(eg_r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^~~~~~~~~~~ group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herenroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(In file included from threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nt/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grouhreapds, wo)rk); , | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :432: 78: | note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~432 | if (ti| d < s tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ubtn) RunW orkColl().rpuSnize(stepSize_ == 0 ? nc(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, Lalgo,L pro>to,( untrolli>().rdun(),; \ | ^ nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nIn file included from cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives0, /*D irect?=*/0, Proton, 0> cprimcslShm | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DE F| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558I:5: note: Nin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here E558 | _runRinng(Ftid, unthrenads, cwork(A); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl(,).ru n(titd, subytn, w,ork); | ^r /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cppe:12d:1: note: oin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here p 12 | , algFo, pruoto, nunrollc>().rSun();u \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'm, half, NCCL_ALGO_670RING | , NCCL_P ROTO_ SIMP LE, 2) | ^ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hi:611:62:d note: expanded from macro 'DEFINE_ncclDevFunc' 611 | (RutnWoirkBatdc)h,, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ic, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 1717 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx90a. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1id = ncclShmem.channelId - work-,> flag1, dcata2hannelLo; | ^~~ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: :5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ d - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hp(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> primsFanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Simple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siIn file included from zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollm(e)t.rriucn<(1t,i dN,C CsLu_bMtAnX,_ DwEoVr_kA)R;I T Y| > ^, /*Direc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cppt:=5*:/10:, note: Pin instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested herer oto, 0> pri m5s | D EF| I ^N E_ncclDevFunc(AllRed/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hu:c565e:_5T:R Enote: Ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here_ LL128_Sum_ f5656 | 4 _ 2 , rnucncTlrFeuenUcpADlolwRneC,C LC_OPLRLO_UTNOR_OLLLL1>2(8t,i d2,) n t| h^r eads, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hw:o611r:k62):; note: expanded from macro 'DEFINE_ncclDevFunc'| ^ 611 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :R432u:n78W:o rnote: kin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested hereB atchi,d a< lsguob,t pnr)o tRou,n WuonrrkoCloll>l(<)F.nr,u nT(,) ;R e\d O p| , ^ Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCo17 warnings generated when compiling for host. ll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1:: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NG_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFunc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | | ^ Primitives, 0, 670P | r o t o ,t i0d>( tpirdi)m,s n t| h ^r eads(n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.ht:h558r:e5a:d snote: )in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here, tidInB l558o | c k ( t hrruenaRdiIndgx<.Tx,) ,R egdrOopu,p (Pgrrootuop,) ,C O L| L ^~~~~~~~~~~~~~~~~_ UNR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hO:L670L:>60(:t inote: dfield 'group' will be initialized after field 'stepSize', nthrea d670s | , w o rtki)d(tid), nthreads(nth;r e a| d ^s ), tidInBl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ho:c432k:(78t:h rnote: ein instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herea dIdx.x), g r432o | u p ( g r o uipf) ,( t i| d ^~~~~~~~~~~ < subtn) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, u670n | r o l l >t(i)d.(rtuind()),; n\t h r| e ^a ds(nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h):,670 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock(thre a670d | I d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_e ads), tidI n671B | l o c k (sttherpeSaidzIed(xs.txe)p,S igzreo_u p=(=g ro0u p?) ,nc c l| S ^~~~~~~~~~~~~~~~~h mem./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hc:o670m:m60.b:u fnote: ffield 'group' will be initialized after field 'stepSize'S izes[NC C670L | _ P RO T Ot_iSdI(MtPiLdE)],/ NnCtChLr_eSaTdEsP(Sn/tshirzeeaodfs()T,) t:i dsItneBplSoiczke(_t)h r{e a d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d x .| x group(group) , group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().etric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt, group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PR:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subOtnT)O _RSuInMWPoLrEk]C/oNlClC| ( group(group) .run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp63::1756::1: note: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested herein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 1763 | | D E F I NPEr_inmcictliDveevsF_,S u0m,_ fP6r4o_t4o,, n0c>c lpFruinmcsA l l| R ^e duce, F/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hu:n558c:S5u:m ,note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested hered ouble, N558C | C L _ A LrGuOn_RTiRnEgE<,T ,N CRCeLd_OPpR,O TPOr_oStIoM,P LCEO,L L4_)U N R| O^L L>(tid,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :n611t:h62r:e anote: dexpanded from macro 'DEFINE_ncclDevFunc's , work); 611 | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx942. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp 21 warnings generated when compiling for gfx90a. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: 14In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cppIn file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrierIn file included from _by_gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.houp(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14In file included from : warning: unused variable 'data1' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5 uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from 27 | const int bid = ncclShmem.channel/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Id - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->chaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nnelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | P/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670rimitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ dop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, u tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); o| u ^p (group),/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp : 17| : ^~~~~~~~~~~1 : note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hup:(670g:r15o:u pwarning: ),initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | 670 | t i dt(itdi(dt)i,d )n,t hnrtehreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Run/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RI:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NG, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlocIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl ( ) .triudn((ttiidd),, snutbhtrne,a dwso(rnkt)h;r e a| d ^s ), tidIn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cppB:l12o:c1k:( tnote: hin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herer eadIdx.x), 12g | rDoEuFpI(NgEr_onucpc)l,D e v| F ^~~~~~~~~~~u nc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), t565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.co/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hmm:.670b:u15f:f Swarning: iinitializer order does not match the declaration order [-Wreorder-ctor]z es[NCCL_PROTO_SIM PL670E | ] / N CtCiLd_(tSiTd)E,P Sn/tshirzeeoafd(s(Tn)t h:r esatdesp)S,i zet_i)dI n{B l | o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c k( t| h group(groupr eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h | : tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_254 :90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 671 | 254s | t e p S i z eP(rsitmeiptSiivzees_< T=,= R0ed O?p ,n cFcalnSAhsmyemmm.ectormimc.S,I M/P*LDEi]r/eNcCtC=L*_/S0T,E PPSr/ostioz,e o0f>( Tp)r i:m ss t| e ^p Size_) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h{ : 565| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~5 : | note: group(groupin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTree/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hU:p303D:o90w:n , FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here, RedOp, P r303o | t o S i m p lPer ,F aCOnLAsLy_mUmNeRtOrLiLc>( , | / ^* Direct=*/0/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h,: 432P:r78o:t onote: ,in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 0> prims | ^ 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h : 565 : 5i:f note: (in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested heret id < sub t565n | ) R u nrWuonrTkrCeoelUlpN(R)O.LrLun>(,t iCdO,L Ls_uUbNtRnO,L Lw>o(rtki)d;, n| t ^h reads, wor/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cppk:)17;: 1 :| ^note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h: 43217: | 78D:E Fnote: Iin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereN E_ncclDev F432u | n c ( A l l Riefd u(ctei_d T,( )N.CrCuLn_(AtLiGdO,_ TsRuEbEt,n ,N CwCoLr_kP)R;O T O| _ ^ SIMPLE, 4/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp): 17 :| 1^: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'17 | DEFINE_ n611c | c l De vRuFnunWco(rAklBlaRtecdhuu,3 2al_g4o, ,n cpcrloFtuo,n cuAnlrloRleld>u(c)e.,r uFnu(n)c;S u\m , | u ^i nt32_t, NC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hC:L670_:A15L:G Onote: _field 'nthreads' will be initialized after field 'tidInBlock'T REE, NCC L670_ | P R O T Ot_iSdI(tMiPdL)E,, n4t)h r e| a^d s(nthre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ha:d611s:)62,: tnote: iexpanded from macro 'DEFINE_ncclDevFunc'd InBlock (611 | t h r e aRduIndWxo.rxk)B,a tgcrohu:,670 :a60l:g onote: ,field 'group' will be initialized after field 'stepSize' proto, u670n | r o l l >t(i)d.(rtuind()),; n\t h r| e ^a ds(nthre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ha:d670s:)15,: tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd InBlock( t670h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~a ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), ti 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDowns), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from , COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_AdInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) Ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const iIn file included from nt bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | ba/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int birdr i=e rn_cbcyl_Sghrmoeump.(c)h;a n n| e ^~~~~~~~~~~~~~~~~~l Id - work->/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hc:h29a:n15n:e lnote: Loexpanded from macro 'barrier_by_group'; | ^~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_A:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | :432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ LGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hR:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ ING_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hL:L5651:258:_ Snote: uin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested herem _u64_2, n565c | c l F u nrcuAnlTlrReeedUupcDeo,w nFR,O TCOO_LLLL_1U2N8R,O L2L)>( t i| d^, nthreads, work); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :| 611 ^: 62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432 :61178 | : note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested hereR unWorkBatch <432c | o l l , t yi,f r(etdiodp <u,b tanl)g oR,u npWroortkoC,o lulnT(,) .RreudnO(p),; A\l g o| , ^ Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, eadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllRe/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, a:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stelgo,p proto, unrolSl>().irun();z \ | e ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h(:670s:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t tid(tid), nethreadps(nSthreadsi), tidIznBlocke(threa_dIdx.x ), gr=oup(gro=up )0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreadsSIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | (nthreadsP), tirdInBliock(tmhreadiIdx.xt), groiup(grvoup), e | ^~~~~~~~~~~ s, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4,:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0up(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.htid(tid), nthreads(nthreads), tidInBlock(threadIdx.x> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | i),f gr oup(g(routp), i | ^~~~~~~~~~~ d < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFunc:A670:15: warning: linitializer order does not match the declaration order [-Wreorder-ctor] l670 | R tide(tid)d, nuthreadcs(nethreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch : s,tepSi ze_) { | a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hg:303:90o: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here ,303 | Pprimitirves().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hFanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | pr ims | t ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565i:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hered 565 | ( rtunTireeUpdDown, CaOLL_dUNROsL(Ln>(tthirde,a dns), tidInBlock(thrthreaeds, waodrk)I; | dx.x ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (t), group(group), | ^~~~~~~~~~~ id < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | rutidn(tid), nthreads(ntRhirng), tidIn(Block(tthreadIdxi.x), groupd(, grnthreads, oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColls[NCCL()_P.run(tROTO_SIiMPLE]/NCdCL_STEPS/,sizeof(T) : stepSisze_) { ub| tn, work); ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group| /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h ^:63 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22::56: 1note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here : note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEF63 | Primitives, 0, PINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' roto, 0>611 prims | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :558 :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | RrunuRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hnote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h group(group:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (ti 611 | RunWorkBatch().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | cIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ onst int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366::218:1515: :warning: unused variable 'bid' [-Wunused-variable] 218warning: | unused variable 'bid' [-Wunused-variable] co nst in366t b | id = const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ncclShmem.channelId - work->ch/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ annelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid =/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ncclShmem.channelId - work->channelLo; | ^~~ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtnuffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), 565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, workCOLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, Func/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), Sum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadItidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSuize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); dOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hs(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ o 671u | p), s| tep ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S ize (st| epS tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ize _ == 0 ?671 ncc | lSh mem .comm. buf fSizses[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)te pSi:ze( stepsSizte_e ==p 0 ?S nciclSzhmeem.c_omm).bu ffS{ize s[ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N CCL _| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | PrPRiOTOm_SIiMPLtE]/NiCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, F0, aPrnotSo, y0m> pmrimes t| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hr:558i:c5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing<1<>,T, Re 0d,O Prpot, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | o, 0> p rim s i| ^f /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :558(:5: tnote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hid < sub:tn432) :Ru78nW:o rknote: Coin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herell ().run(tid, subtn, work); | ^ T, Re/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cppdO:p,12 A:lgo, P1ro:to , note: COin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereLL _U NROL12L> | ()D.rEFINE_ncclDevFunc(AllReduceu_n(RtiIdN,G s_ubStnI, MwoPrkL);E _| ^ S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cppu:m22:_1:u note: 8in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here _222 | D,EF INnE_cncccllDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, nFucnccAlllRFeduucne,c FAlunlcSRume, duiuntc8_et,, N CCFL_AuLnGO_cRISNGu, mNC,CL _PuROiTOn_StIM8PL_E,t 2,) | N^CCL_AL /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,G O_aRINlG,g NoCC,L_ PpRrOToO_tSIoMP,LE , u4)n r| ^o l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hl:611>:62(: )note: expanded from macro 'DEFINE_ncclDevFunc'. r611 | u n (R)unWork;Ba tc\h< co l| l, ^ t y,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h redop, algo, :p670:r15:o note: tfield 'nthreads' will be initialized after field 'tidInBlock' o , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(t670i | d ) t,id (tnidt), hntrhearedadss((ntnhrteahdsr),e taiddInBlock(threadIdx.x), group(grousp)),, t | ^~~~~~~~~~~~~~~~~ i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hdI:n670B:l60o:c knote: field 'group' will be initialized after field 'stepSize' ( th670r | e a d I dtxi.dx()t,i dg)r, ntohurpe(agdrso(ntuhrpe)a,d s )| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, :t670i:d60I:n note: Bfield 'group' will be initialized after field 'stepSize'l oc670k | ( th r e atdiIdd(xt.ixd/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDo)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~s ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ wn, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLLS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiz, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) {/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, e_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdCOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Fn, T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>S(ItMiPdL,E ,n t4h)r e a| d^s , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hnote: :expanded from macro 'DEFINE_ncclDevFunc'432 :78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 611 | 432 | R u n W o rikfB a(ttcihd< cl,l P(r)o.trou,n (C)O;L L\_ U N| R ^O LL>().run(tid, s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hu:b670t:n15,: wnote: ofield 'nthreads' will be initialized after field 'tidInBlock'r k); | ^ 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp :t17i:d1(:t inote: din instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here) , nthreads (17n | tDhErFeIaNdEs_)n,c ctliDdeIvnFBulnocc(kA(ltlhRreedaudcIed_xT.RxE)E,_ SgIrMoPuLpE(_gSruomu_pu)8,_ 4 ,| ^~~~~~~~~~~~~~~~~n cclF/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hu:n670c:A60l:l Rnote: efield 'group' will be initialized after field 'stepSize'd uce, Fu n670c | S u m , tuiidn(tt8i_dt),, NnCtCLh_rAeLaGdOs_(TnRtEhEr,e aNdCsC)L,_ PtRiOdTIOn_BSlIoMcPkL(Et,h r4e)a d I| d^x .x), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hg:r611o:u62p:( gnote: rexpanded from macro 'DEFINE_ncclDevFunc'o up), | ^~~~~~~~~~~611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 21 warnings generated when compiling for gfx942. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclSIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.chanIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flanelId - work->channelLo; | ^~~ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uIn file included from int32_t data1, flag1, data2, f/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cppl:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: aIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:g75:7: warning: unused variable 'w' [-Wunused-variable] 75 | 2barrier_;by_grou p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:| 15: note: expanded from macro 'barrier_by_group' ^~~~~29 | c onst int w = threadIdx.x/WARP_SIZE; \ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ data2, flag2; ^| ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, f/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hl:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:a173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75g2; | | ^~~~~ ba rrie/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hr_by_group():; | 145 ^~~~~~~~~~~~~~~~~~:21: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hwarning: :29unused variable 'flag1' [-Wunused-variable] 145:15: | note: expanded from macro 'barrier_by_group' uint32_t data1, f29 | const inlt w a= thrgead1,I dxdata2, flag2; | ^~~~~.x /WAR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hP_SIZE; \: | 145/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_ ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclt data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ Shmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:75:7: warning: unused variable 'w' [-Wunused-variable] : 75 | 29 barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from :14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uin/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ , flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - workIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> primsIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthr:670:15e: warning: initializer order does not match the declaration order [-Wreorder-ctor]a d670 | tsid(tid),( nthreadsn(nthreads)t, tidInBlochk(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :reads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hOLL>(tid, nt:670h:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] r 670 | ea ds, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested heretid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(gIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hS:670:/15: warning: initializer order does not match the declaration order [-Wreorder-ctor] s670 | i tid(tidz), nthreeads(nthoreads)f, tidI(nBlocT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: k(threain instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested heredIdx.x) , group (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 303| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | t Primoitives, COLL_U, RedOp,N FanSymRmetriOc<1>L, 0, PLT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(ubtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' threadIdx.x), group(group), | ^~~~~~~~~~~ 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' k(th 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2Size(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: : in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | 63 tid( | tid), nthread s(nth reads), tidInPBlockr(threimitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid,S izes[NnCCL_PRtOTO_hSIMPLE]r/NCCL_eSTEPS/asizeofds, work); | ^( T) : s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | tiepSifze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:(tid 63:56: , FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here subtn) Ru 63n | WPrimoitivers, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here Coll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group( 558 | g rurnRing(tid , nthtreadsi, wordk); (| ^tid), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hn:432th:78reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().rtid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, alg/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ho, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidI15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_SnBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: 17 warnings generated when compiling for host. note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 21 warnings generated when compiling for gfx942. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t datIn file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ a1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = nccl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Shmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ threads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:s, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:In file included from 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | In file included from const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(In file included from tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nIn file included from threads)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1201. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from 77 | u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: nIn file included from t32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cppadId:2x.: x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]In file included from 145/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11 | : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h :80:5 : warning: unused variable 'w' [-Wunused-variable] 80 | bar rier_by_group()u; | ^~~~~~~~~~~~~~~~~~ i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15:n note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/Wt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tidIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here s), tidI 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FnBlock(tuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ hreadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ o, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTIn file included from O_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cppS:I2M: PIn file included from L/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hE:]11/: NIn file included from C/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hC:L173_: STE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hP:S670/:s15i:z ewarning: ofinitializer order does not match the declaration order [-Wreorder-ctor]( T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nthreads), tidInBlock(th/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hr:e254a:d90I:d xnote: .in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested herex ), group(g r254o | u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~P r i| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_i tives.,c o/m*mD.ibruefcftS=i*z/es0[,N CPCrLo_tPoR,O T0O>_ SpIrMiPmLsE ] /| N ^C CL_STEP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hS:/565s:i5z:e onote: fin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here( T) : step S565i | z e _ ) r{u n T| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e e U| p group(groupD own,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :C254O:L90L:_ Unote: Nin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereR OLL>(tid, n254t | h r e a d s ,P rwiomrikt)i;v e s| < ^T , RedOp, Fan/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hA:s432y:m78m:e tnote: rin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested herei ci,d /<* Dsiurbetcnt)= *R/u0n,W oPrrkoCtool,l <0F>n ,p rTi,m sR e d| O ^p , Algo,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h :P565r:o5t:o ,note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereC OLL_UNRO L565L | > ( ) . rruunn(Ttriede,U psDuobwtnn<,T ,w oRrekd)O;p , | P ^r otoSimple, 0, 2, 2>::run' requested hereL L_UNROLL>, CO L7L | _DUENFRIONLEL_>n(ctcildD,e vnFtuhnrce(aAdlsl,R ewdourcke)_;T R E| E ^_ SIMPLE_SumPos/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ht:D432i:v78_:u 3note: 2in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here_ 2, ncclFun c432A | l l R e d u cief, ( tFiudn c().run(tid, suexpanded from macro 'DEFINE_ncclDevFunc' 611 | btn, work); | ^ Ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7nW | DEFINE_ncorkclDevFunc(AllReduce_TREE_SBatchIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hp(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:217: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(t warnings generated when compiling for gfx906. hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffS:670i:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] z670 | teid(tisd), nthre[ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == N0CCL_PROTO_ SIMPLE]/?NCCL _STEPS/snizeof(T)c : stcelShmpSize_)e {m.comm .| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ b| uffSizes[NCCL_PROTO group(group_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:S63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested hereIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> 558 | prunRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < suims | ^ b/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: tin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | n runTree)UpDow nRu, COLLN_UNROLL>().ROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorrun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, ukColil(t).run(,tid, subtn ,N CwCorL_kALGO_RING, ); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671: | 670stepS:i15: warning: ze(stinitializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInepBSize_ =l= 0 ? ncoclShmecm.comkm.buffSizes[NCCL_(PROTO_StIMPLhEr]eadIdx.x), g/rNCCL_SToEPS/sizeup(groouf(T) :p )s, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | tepSi ze_) { s | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hepSize(s:303t:90: enote: pin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereSi z e_ == 0 ? 303 | n PricclShmem.commmitives., FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); NCCL_MA X_DEV_A| RITY>, /*Di ^rect=*/0, P roto, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h::565:5432: note: :78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | 432 | if (t irunTredeU p, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkColl, /*Direct=*/0r, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here oto, COL 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId:x670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] . 670 | xtid(tid),) nthrea,ds(nthr eads), gtidInBrlock(throeuapd(Igdroxup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: ROTOin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here_SIMPL E]/NCCL_STEPS/sizeof(T) : stepS ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here | Primitives, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h0, :P670:15: warning: rinitializer order does not match the declaration order [-Wreorder-ctor] 670 | o tid(ttid), onthreads(,nthreads ), 0tidInBlo>ck(th readIdpx.x),r group(grioms | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ht:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tiepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(zsetoefp(STi) z:e _s t=e=p S0i z?e _n)c c{l S h| m ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e m .| c group(groupo mm.buffSizes[NCCL_PROTO_SI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hM:P63L:E56]:/ Nnote: CCin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereL _STEPS/s i63z | e o f ( TP)r i:m isttievpeSsi, 0, Proto/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | C CiLf_ M(AtXi_dD En,W o/r*kDCiorlelcl gpor,i mPsr o t| o ^, COLL_UNROLL>().r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hu:n565(:t5i:d ,note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested heres ubtn, work )565; | | ^ runTreeUpDown/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp<:T12,: 1R:e dnote: Oin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herep , ProtoSimp l12e | (,A lClORLeLd_uUcNeR_ORLILN>G(_tSiIdM,P LnEt_hSruemaPdoss,t Dwiovr_ku)3;2 _| 2 ^, ncclFuncAll/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hR:e432d:u78c:e ,note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereF uncSumPos t432D | i v , u i nitf3 2(_tti,d N().ru n611( | t i d , RsuunbWtonr,k Bwaotrckh)<;c o l| l ^, ty, redop17,: 1a:l gnote: oin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here, proto, unro l17l | >D(E)F.IrNuEn_(n)c;c l\D e v| F ^u nc(AllRedu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hc:e670_:T15R:E Enote: _field 'nthreads' will be initialized after field 'tidInBlock'S IMPLE_Su m670P | o s t D itv_iud3(2t_i4d,) ,n cnctlhFruenacdAsl(lnRtehdruecaed,s )F,u ntciSduImnPBolsotcDk(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ iv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 558 | runRing(tid, nthread 0> psrims | ^ ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidIwork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWornBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hk:670:60: Coll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_StIidInBloMck(threaPdIdx.x), Lgroup(grEoup), | _ ^~~~~~~~~~~ SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll:>670(:).15: run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | 0 tid (tid)?, nthr eads(ntnhreads),c tidIncBlock(thlreadISdx.x), hgroup(grmoup), e| ^~~~~~~~~~~~~~~~~m /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670.:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIncomm.bufBlfock(thrSeadIdixz.exs[),N CCgrLo_uPpR(OgTroup), | ^~~~~~~~~~~ O_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62oup), | ^~~~~~~~~~~ : note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShthreadms),e tidmInBlock(threadIdx..x), groupc(group)o, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hm:670:60:m note: field 'group' will be initialized after field 'stepSize' 670 | . buffSiz tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Dir/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N < subtn) RunWorkColl().run(tid, subtn, work); | ^ L_STEPS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ), groutp(groupi), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ d| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ (671 | t stepSizie(stepSizde_ == 0 ?) ncclShme,m.com m.buffSizes[NCCL_nPROTOt_SIMPLhE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: readsin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here(nthreads ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : st 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | epSiz ^e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h Primitives, /*Direct:=*/0, 432:78:P note: roto,in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (ti0d < sub>tn) Ru nWporkColrl, ProtoSimple<1, 1, 4>, 4>' requested here 565 | ot o, CO LL_UN ROLL>r().ruun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, sub/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.htn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllRe:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreaduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.b| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0u>ffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.htid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCC:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(steLp_PROTO_SSIMPLE, 4i) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hz:611:62e: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru_n W=o=r k0B ?at cnhc, algoC, L_PROT/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hproto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tO_SIiMPLE]/dNCCL_STEP(S/sizetof(T) :i stepSized_) { | ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ , | group(groupn th/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hr:e63ads(nthreads), :56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | :670:15 ^: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hid(tid), nthreads(nthrea:ds), ti558dInBlock:(thr5eadIdx:.x), g roup(note: group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 671 | step Size(stepSi558ze_ == | 0 ? ncc lShmem. cotmi dImnBlock.(threadb Idx.x)u,r groupuf(groupfn), | ^~~~~~~~~~~~~~~~~RS /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:ii670:n60z: note: field 'group' will be initialized after field 'stepSize' e670 | stid(tid[), nthreNads(nthCreads)CL_PROT, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ g(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:O_SIM78PLE]/NC:CL_STE PS/siznote: eof(T)in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here : s 432 | if (tid < subtn) RtepSuize_) n{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ W| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | o Prrimitivkes, 0U, ProNto, 0R> prims O | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hLL>().run(t:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h: const in11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.ht:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h w = :75:7: threadIdx.x/WARP_SIZE; \ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11145: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174 | : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145: 14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t da uta1, fintl32_t data1a, flag1, data2, flag2; | ^~~~~ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag1, data2, flag2; | ^~~~~ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from 145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t datIn file included from a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: IIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: Z/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5E: warning: unused variable 'w' [-Wunused-variable] 80; | barrier_by\_gro up(); | ^~~~~~~~~~~~~~~~~~| /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h ^:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo;/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | const int bid = ncclShmem.channelId - work->channelL:366:o15: warning: ;unused variable 'bid' [-Wunused-variable] 366 | con | ^~~ st int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, fla/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channunused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ elId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*DiIn file included from rect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threaIn file included from dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSiz/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, e_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runT/*Direrct=*/e0, Preoto, 0U> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5p: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Down, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlocnthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u6 tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Pr4_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid)imitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | , nth reads(tnthreads), itidInBdlock(threadI(dx.x),t group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tidrkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | tid), n threads(nthrea ds), tidIn Block( threadIsdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:(173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670s:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]t 670 | tid(tid), enthrepads(ntShreadis), tidInBlozck(threetaepSize(dstepS_iIze_ = =d 0 =? nccl=Sxhmem.c om.m.0buffSixz es[N)CC?L_,PR OTO_SIMPL E]/NCgCL_STEPS/sizreofoup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | (T) : stepSize_ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90:tepSize(stepS note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitiv, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads,L_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78 wor:k); note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hnote: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), 303nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558Op:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | , runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllR FanAsymmetric<1, NCCL_MA:670:15X: warning: initializer order does not match the declaration order [-Wreorder-ctor] _670 | tid(tid),D nthreadEseduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllRed(nuce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | t RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' hrV 670 | tid(tid), nthreads(nthreads),eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' _tidSize(stA 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ epSIize_ ==n 0R B? ncIcllShmeomT.comm.Y>, /*Direct=*/ck(threa0dIdx.x),, bug ffSizers[NCCL_PPoROTOu_roto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | SIM PLE]/NCCLs_STEPS/stizeof(Te) pSize(stepSize_ == 0 ?: s tepSizen_) { | c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clShm | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIem.ncommB.buffSlizeos[NCCcL_PkROTO_SIM(PLEt]/NCCL_SThEPSr/sizeofe(T) : staepSizde_Idx.x), group(group), | ^~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), t | ^i /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15d: note: field 'nthreads' will be initialized after field 'tidInBlock' I670 | tind(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tidBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PR)O, nthTreads(nthreaOds), ti_dInBlocSk(threaIdIdx.x),M groupP(group)L, | ^~~~~~~~~~~ E]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==ffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gic<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARPIn file included from _SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, fIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lag1, data/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadI | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ IZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclIn file included from Shmem.com/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tim.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthrea | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hds, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :558d), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread:5:( note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558( | runRitng4(tid, )nthre==3), group(gads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, LL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncnthrAeads-lnthreladsSpRlit, &etree-d>up, utree-c>downe, wor,k->se ndFuncSumPostDiv, uinbutff8, wo_rk-t>re,cvbuff, | N ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tiCdCL_,ALG O_nRINGt, hNCCLr_PeROTOa_SIdMPsLE,, 2) | ^w /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ork ); | ^ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hi:432d:78:( note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | t i ifd (t)id ,< s ubtn) RunWorkColl(nth)rea.ds(rnthrueands),( ttid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ evFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_P 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tiROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)L_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouAX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> primsp(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, Func/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> priSms | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ umPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads( 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | PrimitiveIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78s, /*Dir/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(ect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, wor: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hthreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: CnthreaLds), t_idInBMlock(AthreaXdIdx._x), gDroup(Enote: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ V_ARITY>, /*Direct=*/0, Proto, 0> prgroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreadsO_,SIM PLEw]/NoCCLr_STkEP); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> primorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRedusc | e ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h_:565:T5: Rnote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | E Erun_TreSeUpIDowMn, vCOL_L_UuNRO8LL>_(ti2d, ,nth reands,c wocrk)lFunc; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ kColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepS:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bufize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hfSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSi:670:ze_)15 { | : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.hwarning: :303:90: initializer order does not match the declaration order [-Wreorder-ctor]note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Di| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizreect=*/s0, Pr[NCCL_PROTO_SIMPLE]oto/, 0> Nprims C | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | if (tid P< subrtn) RiunWormkColli, 0, Proto, 0>COLL_ UNRpOL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hL:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tidrims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRi>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sumng(tid, nthreads, worPokstD)iv_u;8_2 , n ccl| Fu ^ncAl lR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.heduce, :432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollunc(Sum)Pos.tDivr,u uinnt8(_t,t NC, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ iCL_dALG,O_T REEs, NuCCLb_PRtOTOn_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h algo, pro:670:t15: owarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(st,e woprk)S; i| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cppz:22e:1:_ note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here =22 | D=EFI NE_0ncc lDe?vFu nc(nAllcRedcucel_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gShmreom.cuommp.bu(ffSgizers[NoCCL_uPROpTO_)SIM,PLE ]/ NCC| L_S ^~~~~~~~~~~~~~~~~TEP S/s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hizeof:(670T) :: s60tep:Siz enote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidI_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:ostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SI 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); MPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAll if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevF/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threaunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), gIn file included from roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ CL_MAX_DEV_ARITY, 1>, /*Direct=/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nth/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4:,670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] n670 | ticd(tidclFu), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10In file included from : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RIIn file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NG, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ng(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp10:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:111:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 111 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Broadcast_RING_LL128_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:111:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 111 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Broadcast_RING_LL128_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 1111 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx942. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1200. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, heaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ d, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx90a. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | In file included from stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tidIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_ST(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads),In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here In file included from 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, al/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ go, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 1010 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp 10 warnings generated when compiling for gfx90a. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:13: 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:145:14: warning: unused variable 'data1' [-Wunused-variable]: 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75: 7145 | : warning: unused variable 'w' [-Wunused-variable] ui75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cnt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ onst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from 35: warning: unused variable 'flag2' [-Wunused-variable] 145 | ui/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1n: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:32_t data1, flag1173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29, data2, flag2; | ^~~~~ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, f); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)=In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthr=3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMaxeads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ , double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: agThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | coIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_byconst int w = threadIdx.x/WARP_SIZE; \ | ^ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barri/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:In file included from 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:14: warning: :unused variable 'data1' [-Wunused-variable] 145174 | ui: nt32_t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h data1, flag1, d:ata2, f75lag2; | : ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h7:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0ze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ > prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx90a. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uinIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ t32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hu:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:i145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t int w = threadIdx.xIn file included from /WARP_SI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:Z1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: EIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145;:14: warning: unused variable 'data1' [-Wunused-variable] 145 | u\ | ^ int32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 10/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ RP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthr10 warnings generated when compiling for gfx942. In file included from eads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group),In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid)In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from , n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hthr:ead13s(n: tIn file included from hrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hds):, t173idI: nBlo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hck(:thr670ead:Idx15.x):, g rouwarning: p(ginitializer order does not match the declaration order [-Wreorder-ctor]rou p) 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),, | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx90a. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199: 57flagThre:ad((ti d%4)==3)note: , group(gin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereroup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize199(ncclShm | em.comm. buffSi zes[NCCL_PPROTO_LLr128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitimitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreteclRunInterpreter, ProtoSimple, fullOpsrype,( Fucnc##odevremdopm, ProtaoLL1l28, gfullOops>(,comm , alwgo,o worrk); k\); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid | ^ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, In file included from 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), ntop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREIn file included from DOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx942. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7In file included from : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data145 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for gfx1102. In file included from 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp :ba1r: rIn file included from i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.her_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tWARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ id/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: 496In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h | :13 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670t:15i: dwarning: initializer order does not match the declaration order [-Wreorder-ctor](tid), nthreads(nthreads), wid(tid%WARP_SIZE), w670 | a r pti(d(titidd),/ nWtAhreadRs(nPth_rSIeaZdsE), )ti, d InB| lo ~~~~~~~~~~~~~~~~~~ck (t hr| eadIdx.x), group(group), stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ war| pI tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_nB 671lo | ck ( stepSize(stepSize_ == 0 ? ncclShmem.comm.bthreadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NufCfSCizLes_[NPCCRL_OPRTOTOO__SLIMLPLE1]/NCCL2_S8TE]P/SN/sCizCeoLf(_T)S :T sEtePpSSiz/e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> psirzeoif(muisnt 64 _| t) ^ ) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp{ | : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3 :| group(group1 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hnote: :in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here199: 57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here3 | 199 | M S CPrCimL_IMPL_Kitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax,ERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, ProtoLL128, fullOps>(comm, algoL,IC ESwTEoPSr, kIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpI2>), ;fu ll\Ops >( c| omn ^Block(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PS/MSCCL_SLICESTEPS, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | In file included from MS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cppC:C1L: _In file included from I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hM:P13L: _In file included from K/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hE:R173N: E/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hL:_670E:N15T:R Ywarning: _Finitializer order does not match the declaration order [-Wreorder-ctor]U NC_DEVREDOP_TYPE( M670i | nM a x , tuiidn(tt3i2d_)t,, nftahlrseea)d;s ( n| t^h reads), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.ht:i384d:I3n:B lnote: oexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'c k(threadI d384x | . xm)s,c cglRunInterpreter ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_P rotoLL128, 671f | u l l O psst>e(pcSoimzme,( satlegpoS,i zweo_r =k=) ;0 \? n| c ^c lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from 10 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h: :13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPIn file included from L_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(g/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from roup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(ntIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 10 warnings generated when compiling for gfx942. | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: , Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthr warnings generated when compiling for gfx1030. eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp::1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. In file included from 10 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75In file included from | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bar/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hr:ier_by_173group();: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | baIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from data2, fla/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:g warning: unused variable 'data1' [-Wunused-variable] 145 | 2 uint3;2_t data 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t da | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uitna1, fltag1, d3ata22, flag_2; | ^~~~~t /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28 : warning: unused variable 'data2' [-Wunused-variable] d145 | uaint32_tt dataa1, fla1g1, dat,a2, fl agflag1, data2, flag2; 2;| | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h ^~~~~:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 67110 warnings generated when compiling for gfx1201. | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter,In file included from ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread(| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175nt64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | trccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ id(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false);SCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Pr o | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ d, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | ^ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grouIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warphIreads),n tidInBBlock(lthreadIodx.x),c group(kgrou(p), t| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~hreadId | x tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_10 warnings generated when compiling for gfx1101. 671 | .x/WARP_SIZE), ste| pSize( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ste | warp(tid/WARP_SIZE 498 | pS ize_ = = 0 flagThread((tid ? %ncclShm4em.com)m.buff=S=iz3), group(group)es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h, | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | :199: 57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here P199 | Prrimitiives, 1, eProto,s 0> pr, ProtoSimple<2, 2, 2>, false>' requested here n3 | MSCCAL_IsymmMPeL_Ktric<1,ERN1EL_EN>, 1, ProtTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' o, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | 387 | msc clRumnIntserprcetecrt, ProetoSirmplep,C L_SLPICESTrEPS,o MStCCL_oSLICELSTEPLS, 21>, f28, ullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid)fullOps>(comm, algo, work); \ | ^ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx90a. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1200. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:ata1, flag1, data2, flag2; | ^~~~~ 5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrea3: note: ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' Size(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInB 384 | mscclRulock(threanIdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(:1): In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:;173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrie r_by_grou| p(); | ^~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h const int w = t:hreadIdx29:15: note: expanded from macro 'barrier_by_group' .x/29WARP_SIZ | E; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: gIn file included from roup();/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29unused variable 'w' [-Wunused-variable]: 75 | 15: note: expanded from macro 'barrier_by_group' 29 barri | e cor_by_group(); | nst ^~~~~~~~~~~~~~~~~~int w = t hre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: aexpanded from macro 'barrier_by_group'dIdx.x/W A 29 | RP_SIZE; const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from int32_t data1, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28:f lag1, warning: data2, unused variable 'data2' [-Wunused-variable]flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145145 | uin | t32_t d ata1, f lag1, d ata2, flag2; u| ^~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:i28: warning: unused variable 'data2' [-Wunused-variable] n145 | uitnt32_t3 data1, f2lag1, d_ata2, fltag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:d145:35ata1, :f warning: unused variable 'flag2' [-Wunused-variable] 145lag1, data2, flag2; | ^~~~~ | uint32_t data1, flag1, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uintd3ata2, 2flag2_; | ^~~~~t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80o:5: warning: nunused variable 'w' [-Wunused-variable] s80 | t barr ier_biy_grounp()t; w = th| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | creadIdx.x/WARP_SIZE; \ | ^ onst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ E; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nt10 warnings generated when compiling for gfx1200. hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 49810 | flagTh warnings generated when compiling for gfx1201. read((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ PS, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: Mwarning: SCCfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]L_SLI CESTE PS, 2>, fullO496ps>(co | mm, algo, work ); tid(tid), nthreads( n\ | t ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hh:670:15r: note: field 'nthreads' will be initialized after field 'tidInBlock'e 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:a670ds), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threa:d60: note: Ifield 'group' will be initialized after field 'stepSize' d670 | x ti.d(tidx), n/thrWeadAs(RP_SIZE), n| thre ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ads) , ti dInB| lock warp(tid/WARP_SIZE( 498 | flagThread((tid%4)==3), group(group), thr eadId| x.x) ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~, g roup | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | ( gro up), s| ^~~~~~~~~~~ tepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~:60: note: field 'group' will be initialized after field 'stepSize' 670 | | tid tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_(ti d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunIn:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warptIerpreter, ProtoSimple,n fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oat, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx90a. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, headIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ , mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp :1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 14575 | | bar rie uir_byn_groutp(); 3 | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h2:29:15: _note: expanded from macro 'barrier_by_group' t29 | condst inat w =t threaadIdx1.x/W,ARP_ SIZE; f\ | l ^ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1 | : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7 : warning: unused variable 'w' [-Wunused-variable] 75u | i bnarrietr_by3_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =10 warnings generated when compiling for gfx1030. threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =10 threadIdx.x/W warnings generatedA when compiling for In file included from RP_SIZgfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: E; \ unused variable 'w' [-Wunused-variable]| ^ 75 | barrier_by_group(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:291: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h: | 13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barricer_byo_grnoup()s; | t ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29 :15: note: iexpanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75dIdx.x/WARP_S:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ IZE; \ | ^ nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx90a. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreadIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpps(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3:)1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vredop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, heaIn file included from d, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadId/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t datIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fa1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'w' [-Wunused-variable] 80In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here ic<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm,199 | al g oP,r iwmoirtki)v;e s\< T ,| ^R edOp, FanAsymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx90a. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' :29 | const int w = threadIdx.x/WARP_SIZE; \ 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | u iconst intn w = thretadIdx.x/W3ARP_SIZE; 2\ | ^ _t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xIn file included from ), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/In file included from WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:]13: In file included from //builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:N670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] CCL_ S670 | tid(Ttid), ntEhreads(nPthreadSs), t/sizeofidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DE /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreadVsREDOP_(TYPE(nProd, intt32_t, hfalse); r | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384e:3: anote: dexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE's ) , tid384 | ImnBsclcolcRku(ntIhnreadIdx.x), grouterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ p(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:10 warnings generated when compiling for gfx1100. 384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx90a. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(st1e: pIn file included from Si/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hz:e13_: In file included from =/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h=: 1730: ?/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :n670c:c15l:S hwarning: minitializer order does not match the declaration order [-Wreorder-ctor]e m.comm.buffSizes[NCC L670_ | P R O T Ot_iSd(ItMiPLdE)],/ NnCtChLr_eSaTdEsP(Sn/tshirzeeaodfs()T,) t:i dsItneBplSoiczke(_t)h r{e a | d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I d x| . group(groupx ), group(group), | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 199 :| 57 tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 671199 | | P r ismtietpiSviezse<(Ts,t ReepdSOipz,e _F a=n=A s0y m?m entcrcilcSm,. c1o,m mP.rboutfof,S i0zes[N>C CpLr_iPmRsO T O| _ ^S IMPLE]//builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cppN:C3C:L1_:S Tnote: Ein instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested hereP S/size o3f | (MTS)C C:L _sItMePpLS_iKzEeR_NE)L _{E N | T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R Y _| F group(groupU NC_DEVREDOP_TYPE(Prod,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h :i199n:t576:4 _note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here, false) ;199 | | ^P rimit/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hi:v387e:s3<:T note: ,expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' R edOp, FanAsy m387m | e t rmisccn, I1n,t Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, woIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ erpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx90a. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ \ | ^ 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | baIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx90a. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagTIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, :u warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] i496 | tind(tid),t nthrea3ds(nthr2eads), _wid(tid%WAtRP_SIZE,), warp( tid/WARfP_SIZE), a | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) l se)497; | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384: war3pInBlo: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRcku(threadnIdx.x/WAIRP_In file included from ntSIeZrprEe), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:ter, IProtoLZL128, EfullOp)s>(comm,, algo , workw); \ a| ^ rp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) p), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto497 | , w arpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h 498 | flagThread((ti:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTOIn file included from _LL1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h8:13: In file included from ]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: //builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:N warning: initializer order does not match the declaration order [-Wreorder-ctor] C 670 | tid(Cti0Ld> p_)ri,Sms T | ^nEtP /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cppSh:3:/r1: senote: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here ai 3 | zdMSCesCL(o_IMnPfL_tK(ERhNELrea_ENdTsRY_)FUN,C _DEtuiintdV64IR_tnE))BD {l O o| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c_k T (threadIdx.x), groYPEu(Prpod,( guinrt32o_t,u fapl| group(groups) ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h e:199 :)57| :; ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here | | ^199 tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hP ri:mit671384iv | :e 3s< T:, R enote: dsOexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'pt, e FpanAS384syize(stepSize_ == | 0msc clR?unI ntenrprcetecr. , cF1,ou Pmnrotmco,.# 0b#> udprfeimfvs Sr | ^ie /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cppzdeo:3sp:1[<: Ntnote: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereype>, Prot 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVRCCLE_PRDOTOO_SIPMPL_E]/TNCCLY_STPEPSE/si(zeoPf(Tr) : ostedpSi,ze_) { u | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i oLn L12t| 3 group(group8,2 f_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hutllO,:ps 199>(f:coa57mml:, s alenote: gin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here)o, ; w ork);199 \ | | ^ Primitives< T | ^ ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h: 384:3:R note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'e d384 | Op, FanAsymmetric<1 ,msc1clR>unI,nte rpr1ete,r prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FU#devredop, ProtoLL128, fullOps>(comm, algo, work); \ | ^ NC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThreaIn file included from d((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1_, Proto,P 0ROTO_> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hSIMPLE]:/NCC384L_STE:PS/si3zeof(:T) : stepSnote: izIn file included from expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpe_)r { | e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.he:199:57: rnote: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199< | type, Func##devredo Primitives, 1, Proto, 0> prims | p, P roto/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cppLL128, full:Ops>(3comm,: algo, 1work:); \ | ^ note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tidin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), _t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3 | ^~~~~~~~~~~ : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTIn file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> pri/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670O_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclmSs | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] | ^~~~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlocIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRuIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: nInterpreter, Proto Simple, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ TEPdS, MSCCL_SL(ICESTEtid), nthreads(nthreads), tidInBlock(threadIPdS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Prthreadis), tidmInBlocik(threatdIdx.xi), grouvp(grouep), | s ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670<:60: note: field 'group' will be initialized after field 'stepSize'T 670 | , tid( tid), Rnthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, FanAsymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTOIn file included from _LL128]/NCCL_STEPS/siz/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid)eo, nthreads(ntf(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNELh_readEs), wiNd(tid%WTARP_SIRZE), wYarp(ti_d/WARFP_SIZEU), | ~~~~~~~~~~~~~~~~~~N | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) C497 | w_arpInDEVREDOP_TYPE(Prod, uint6Block(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | 4_t warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3, fals e); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3:499 note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' | 384 | msccl RunIn ter pretesr, ze(nccProtoLL128, fullOps>(comm, algo, work); \ | ^ lShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmtepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(ProdIn file included from ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx90a. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from Idx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp :1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hi:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: w = threadIdx./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from Id/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:x1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13.: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: x/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: /warning: unused variable 'data1' [-Wunused-variable] 145 | W uiAnt32_t Rdata1, flag1, data2P, flag_2; | ^~~~~ S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21I: warning: unused variable 'flag1' [-Wunused-variable] Z145 | E; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work)In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, u; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498int8_t, false); | ^ :29: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid) | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WAR]P/NCCL__STEPSS/sizeoIf(T) :Z stepSiEze_) {) | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | , group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h: 199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here | 199 | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Prim itives , 1,498 | flagThread( Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTR(tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Y_F UNC_DPEVREDrOP_TYPE(Prod, iuintm8_t,i falste); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3:i note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' v387 | emscclsRunIn, ProtoSimple, S1, PrToEPS,to, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); 2> , fu| llO^ps> (co/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hmm,: al384go, :wor3k):; \ | note: ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE':670 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 384670 | | tid (timd),s nthcreacds(lnthRreauds)n, tIidInnBltoerpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tidIn file included from (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.xIn file included from /WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13 | : flagThrIn file included from ead/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h(:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h(tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.ncclcShmem.coomm.buffSmizes[NCCL_mPROTO_LL.128]/NCbCL_STEPS/suizeof(uintf64_t)) { fSizes[NCCL_PROTO_SIM | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ P| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hL:199:57: Enote: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here ]199 | Pri/mitiveNs, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSzeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_CCL_IIMPL_KEMRNEL_EPNTRYL_FUNC__DEVREDOKP_TYPERNEL_ENTRY_FUNC_DEVREDOP_TYPEE(Prod(, uinPt8_tr, falose); d| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h,:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOp uinst8_t, >false();comm, algo, w | o^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:r387:3: note: kexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' )387; \ | ^ | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL12In file included from 8]/NCCL_ST/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:E1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from P/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:S15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670/ | sizeof(uint64_t)) { t| id(tid) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, nthrea ds(nthre | a group(groupd /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: s), note: tidInBlocin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herek( 199thre | PrimiadIdxt.x),ives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, ProtoLL128, fullOps>(comm, algo, work); \ | ^ >, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from t int w = threadIdx./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uintIn file included from 3/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:213: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: _/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: tunused variable 'w' [-Wunused-variable] 80 | barriedr_by_graoup();t | x/W ^~~~~~~~~~~~~~~~~~AaRP_S IZ1E; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h\ ,| ^ f:29lag1, data2, :15: fnote: expanded from macro 'barrier_by_group' 29lag2 | c;onst int In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_lag2;g | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hr:145:oup(); | ^~~~~~~~~~~~~~~~~~ 28: warning: unused variable 'data2' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h 145 | uin:t32_29t data1:, fl15ag1, d:ata2, note: expanded from macro 'barrier_by_group' flag2 ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | 29 uint32 | _t data 1, flag 1, data 2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> p10 warnings generated when compiling for host. rims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cppd/WAR:P_SIZE),1 | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) : 497 | wIn file included from arp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: InBlock(threadIdx.x/WARP_SIZE), In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread(( 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizetisd%4)==3[), grouNp(groupC), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ C | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | L ste_pSize(ncPclShmemR.comm.bOuffSizeTs[NCCL_OPROTO_LL_128]/NCSCL_STEPSI/sizeofM(uiPLE]/NCCL_STEnt64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | PrimitivePsS/sizeo, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims mm| etric ^<1,1>, 1, Pr oto,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp 0> prims | ^: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:33:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSC:CL_I1MPL_K:ERNEL _ENTRY_note: FUNC_in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested hereDEVRE 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_DOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter#, Pr#otoLdL128e, vfullrOps>e(comdm, aolgo,p, ProtoSimple wo, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h::13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:17475: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable]:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 10 warnings generated when compiling for gfx1201. 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cppeter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:In file included from 1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g10roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thread13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIdx.x/WARP_SIZE; \ | ^ :13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from ZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670, :Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),In file included from group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cppi:d1): ,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hn:t13h: rIn file included from e/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.ha:d173s: (n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ht:h670r:e15a:d swarning: ),initializer order does not match the declaration order [-Wreorder-ctor] tidInBlock(threadIdx.x )670, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~r eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ HUNKSTEPS/MSCCL_SLICESTEPS, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 670 | tid(tid), In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ , group(group), | ^~~~~~~~~~~ comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hreadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(talgo, work); \ | ^ id), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13In file included from : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunIn file included from Interpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp 10 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1I: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:d173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: xwarning: unused variable 'w' [-Wunused-variable] 75 | . barxrier_by_gr/oup(); | W ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:A15: note: expanded from macro 'barrier_by_group'RP_SIZE; 29 | \ | ^ const int w = threadIdx.x/WAR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1P: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from _/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18:S warning: unused variable 'y' [-Wunused-variable] 77 | I In file included from uiZnt32_t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ yE, head, ma;ntissa; | ^ \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from dIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from t int w = thr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ eadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h :12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:f77:18: warning: unused variable 'y' [-Wunused-variable] l77 | a uint32_t gy, head, man2tissa; | ^; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_byIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const barriier_by_gnroup(); t | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: 29:15: note: expanded from macro 'barrier_by_group' w 29 | const i=nt w = th readIdx.xt/WARP_SIhZE; \ | ^ rIn file included from eadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ncclShmem.coIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(ntIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ constIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:y_29group();:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:1529:15: :note: expanded from macro 'barrier_by_group' 29 | const note: int w expanded from macro 'barrier_by_group' 29 | = thre adIdx. x/WARP_S IZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP| ^ _SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRuIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ nInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, woid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | , stepSiz e(stepSiz1e_ == 0 ? ,nc Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cppclShmem.comm.buffSizes[N:3:1CCL_PROTO_SIMP: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSLE]/NiCCL_mSTEPS/psizeolf(T)e : ste, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | PriCmitivesL,N 1, KProto, S0> priTms | ^ E/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:P3:1: note: Sin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | M/SCCL_IMMPL_KSCCL_SLICERNEL_ENTRY_FUNC_DEESTEPS, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(VREDtOP_TYhPE(Surm, inet64_tadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nt, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm,hreads(nthreads), tidInBlock(threadIdx.x), group(grou algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread(In file included from (/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1read((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13:: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]note: 670 | in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here tid(t id), nth reads(nthrea199ds), tid | InBlock( threadId x.x), grPoup(grroimitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_Dvredop, ProtoLL128, fullOps>(comm, algo, worEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterprIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(teter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVRhreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPEDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | PrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: imitivesinitializer order does not match the declaration order [-Wreorder-ctor], 1, Pro670to, 0> pri | ms | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp :3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSC CL_IMPL_ tid(tid), nthreadsKERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInt(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSizeerpr(eter, SProtoSimipleO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ , ful| lOps group(group>(com m, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.halgo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitivesds), ,tidIn Block(1threa,dIdx.x ), grProto, 0> proup(igroupm), | s ^~~~~~~~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVRE /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hD:670:60: Onote: field 'group' will be initialized after field 'stepSize' P670 | _ tidT(tid)Y, nthPreadsE(n(Sum, int8_t, fathrleadss), teidIn)Bloc;k(th read Idx.| x), ^grou p(g/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hroup), :387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRu | ^~~~~~~~~~~ nInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13In file included from : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t da_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ata1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: In file included from unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hIn file included from :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]: 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, w/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, ork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(th/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ?/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 11 warnings generated when compiling for gfx942. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp 11 warnings generated when compiling for gfx942. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(RIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ educe_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFunthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiRING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: In file included from warning: unused variable 'y' [-Wunused-variable] 77 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp: 1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h: 15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | ui nt32 uint32_t y, head, mantissa; _t y, hea d, mantis| sa; | ^ ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15 uint32_t: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h: 77:18: y, warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head,In file included from head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: mantiIn file included from ssa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:28: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: unused variable 'data2' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | b145arri | uer_by_groint32_t data1up(); | ^~~~~~~~~~~~~~~~~~, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29: flag1, d15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, fl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cppag:1, data22, f: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dat/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uIn file included from int/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp3:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from 2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:_7: warning: unused variable 'w' [-Wunused-variable] 75t | ba rrier_by_gdroup(); | a ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15ta1, flag1, da: In file included from note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h145:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from IZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:f2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11l: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145a:14: warning: unused variable 'data1' [-Wunused-variable]g1, da t145 | a uin2t32_t data1, flag,1, dat a2, fflag2; l | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:a145:21: warning: gunused variable 'flag1' [-Wunused-variable] 145 | 2 ui;nt32_t datIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | a1 ^~~~~, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | u flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, datain2t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t d,a flatg2; | aIn file included from 1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_b ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hy:145:35:_ warning: unused variable 'flag2' [-Wunused-variable] g145 | r uiont32u_t dapta1, (flag1), da;ta2, fla g2; | | ^~~~~ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bar r consit inte w = rthre_adIdxb.x/WARyP_SIZ_E; \group(); | | ^ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h::175: 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barri/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from 29/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h: 11 prim: s(tid, ntIn file included from hreads, &/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hring->pre:v, &ring-173>: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] next, work->se670 | ndbuff, work->recvbuff, work->redOpArg, 0, work tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? -ncclShm>connIendex, womrk->conn.Index);c | ^o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63m:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here m63 | r.unRingS(tizes[NCCLid_, nthrePadROTO_SIMsP, workL); E]/NCCL_ST| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: EPS/sizein instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work-> RredecOp,v Algob, Prouto, COfLL_UNfROLL>,().ru n(tidw, soubtn, rwork)k; | ^- /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:>12:1: note: rin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12e | DEFINdE_nccOlDevFpunc(RAeduce_rRING_g, 0, work->connIndSIMePLE_MxinM,ax_u3 2_4,work->c ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_onnIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlocL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE)63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redIn file included from OpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().lrun(tSidhmem.comm.buff, suSbtn, wiork); z | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cppe:7:1:s note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here [7 | DENFINE_nCcclDevFCuncL_PROTO_SIMP(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FunLcE]/NCCML_STEPiS/sizenof(T) M: stepSaize_) x{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ,| group(group uint32_t, NCC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, woL_ALGOr_RIk->rNG, eNCCL_cPROTO_vSIMPLbE, uff, work->redOpArg, 0, work->connInde2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), x, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid,tid InBlonck(thrteadIdhx.x),r groeads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | uip(gfrou p)(tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here In file included from 77 | runRing(dtid, nt)hreads, nthreads(nthread, worsk); | ) ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:,78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | ifw (tid ().rIuZE), nwar(tid, sp(utid/WbARP_tSIZE),n | ~~~~~~~~~~~~~~~~~~ , | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | w waropInBlrock(tkhread)Idx.;x/WAR P_ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFSIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncINE_cncclDlevFunSc(Reduhce_RINmG_LL1e28_MimnMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: .cnote: omm.bexpanded from macro 'DEFINE_ncclDevFunc'uffSi zes[ NCCL_PR611OTO_LL128]/N | CCL_ STEPS /sizeo f(uin t64_tR)) { u | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hW:33orkBatch, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | r priemsdop, (tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, woarlgo,k prot-o, un>roll>(r).rune(); \ | ^ dOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tidIn file included from 432 | if (tid < subtn) Run/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:Wo2rkColl: ().(t:rid), u173nthren: ads(nt/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h(hre:tads),670i tid:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthread,d subtns, work)); | ^ ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1 : note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested heret 7 | iDEFINE_dncclDIInBleocnk(tvhrBeadFIdlx.xu), ogronupc(gcroukp), ( | ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_t 671 | he sdrtepSizuee(stecapSizee_d == 0 I_? nccRdlShmeIxm.comm.N.buffx), group(group),G_SIMPLE_MinMax_u32_2, ncclSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s sttepSieze(pstepSSFuniicRezzduceee, __Fun) cMin =Ma=x, u int320_t, N? ncclShmem.comm.bufCCLf_ALGSO_RIiNG, NzCCL_ePROTsO_SI[{ MN| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ PC | group(group CL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hLE:33_,:7:P note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereR2 O)33 | T O p_| rim^Ss(t Iid,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hM ntPhL:611rEe:]/NCCL_STEPS/siz62: enote: expanded from macro 'DEFINE_ncclDevFunc' o611 | f R(unWoTrkaB)dsa, &rti:ncg- >hprse, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here ll , ty,33 re | dop , alg o, pro to,p runrioll>m().sr->(nuetxnti,( dw)o,r;k - >ns\etn dhb urf| fe, ^a wodrk s->/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h,re cvb&:ufr670:f, 15wo:rik -n>note: redfield 'nthreads' will be initialized after field 'tidInBlock'OpA rg, 0, wo670rk- | >co nnIndex, work->cg-o>prnev,n &rIingn->nd ee txxidt)(t,;id ),w nthr| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: ork->sendbuff, work->recvbueadfs(nfthr,ead s),w tiodInrBlokck(-thr>eadIrdx.ex),d gnote: Oin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested herer p o63Au | rp g( r,gun rR0oin,ug< pT,w) Ro,edr Opk , -| Pr> ^~~~~~~~~~~~~~~~~otc o,o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h nCOLn:L_UI670NRn:OLd60L>e:( xtinote: ,d,field 'group' will be initialized after field 'stepSize' n wth orrk->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | run670 | R tiid(ntidg)<, nTthr,ead s(nRthreeadds), OtidpInB,loc k(tPhreradIodx.tx),o gr,eoa dusCp, Ow(oLrgkL)r;_ o | Uu ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hN:432Rp:78O)L: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereL >,(432 | t i d| , ^~~~~~~~~~~ if n(tid < subtn) RunWorkColl().run(tithrdeads, ,wor k);s | u ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | ibf (tid < subtn) RunWorkColl, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>()In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRingconnIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work)OLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xRunW), group(grouporkBa)tch, alg| o, pro ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~to, u nroll>() .run();| \ | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: 670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid671(tid), | nthre ads(nt hread s), ti dInBlsock(thrteaepSize(stdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | epSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :7: note: tiin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | primd(tid), nthreasds((nthtreadis), dtidI,nBlo ck(thnreadtIdx.hx), rgrouep(graoudp), s | ^~~~~~~~~~~ , &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid432 | if (tid < subtn) RunWorkColl().ru),n nth(readts(ntihreadds), ,ti subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: dIin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested herenBlo ck(thre adIdx.x)12, gro | up(gDroupE), F| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ I| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ NE_ncclDevFunc671 | ( stRepeSized(steupSizce_e == _0 ? RncclIShmeNm.G_SIMPLEc_omm.MbuffiSinMax_f8_4, ncclzFes[uNCCLn_PRcOTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62e:pSi ze_)note: { expanded from macro 'DEFINE_ncclDevFunc'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:6117: note: | in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | pri ms(tRunWorkBatchid,< nthcreadso, &lringl->pre,v, & ringt->neyxt,, red oworkp-, algo, proto, u>snendbruff,oll>(). rworku->recnvbu(ff, )work;->re d\ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15O:p note: field 'nthreads' will be initialized after field 'tidInBlock'Ar g, 0, 670wo | rk- >connIndex, work->connIndex); | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:t63:5i: note: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here( t63 | riunRding)(t(id,n ntthreahds,r woerak);d | ^s /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h):432:,78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclD/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grCOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou)p; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ OTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, wor/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hk->/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), 10 warnings generated when compiling for host. nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_byIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ fSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCLIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' _PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h, :2670): 15 :| ^warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 670611 | | tRiudn(Wtoirdk)B,a tncthhd,I naBllgooc,k (ptrhorteoa,d Iudnxr.oxl)l,> (g)r.rouupn((g)r;o u\p ) ,| ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 671 | st e670p | S i ze ( tsid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 432 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hnthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.com:670:15: mwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid,.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670: nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Run/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work);/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCLIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_RING, NCCL_PR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tidNCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, wor(tikd), nt-hreads(>nthreadrs),e tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stecvbpuff,S work->iredOpAzrg, e0, wor(k->constepSize_ == 0 ? ncclShmem.commn.Indbex, wuork->fconnfIndeSx); | i ^ z/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here es[NCCL_PROTO_SIMPLE]/NCCL_STEP63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33 :432 | 7 : i f (tinote: din instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &r< siubtnn) RunWgorkC-olldOp,p Algore, v, &ring->next, workPr-ot>o, COsLL_eUNROLLn>d().rubnu(ftfi,d, wsoubtnr,k -w>orrekc)vbu; f f, work-| ^> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cppr:7:1edO: note: pin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here Arg, 0, work->c7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_onnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: PreMulSin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested hereum_bf1 6_2, nc63 | cl F un crunRRiednugc(tid,p_bfl oat16, nNCCL_tALGhreadsO_RI, work); NG, | NCCL ^_PROT /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hO_SIMP:432:78LE, :2) note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :611:62 : note: expanded from macro 'DEFINE_ncclDevFunc' if (tid < s611 | RunWorkBatch, aubtn) RunWorkColl(L).run(>); \ ( | ^ )/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15.:ru n(tid, subtnnote: field 'nthreads' will be initialized after field 'tidInBlock' , 670 | tidw(tork);id) , n | ^thr /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cppead:7:1:s(nth reanote: din instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here s) ,7 | DtEiFdINInE_nBccllDevFunc(Redouck(thcreadIedx._x), gRrIoNG_SIMPLE_PreMulSumu_p(gbroup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tidf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ )/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h, nthreads(n:thr611eads), tidI:n62Blo:ck(th readInote: dx.expanded from macro 'DEFINE_ncclDevFunc'x), g roup(group), | ^~~~~~~~~~~ 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15In file included from : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11/WAR: P_SIZE;In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h: \ | ^ In file included from 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | :uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_groupIn file included from ();/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for gfx906. 10In file included from warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group10 warnings generated when compiling for gfx1030. ), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, n) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for gfx1102. 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | In file included from ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ algo, p r671o | t o , usntreoplSli>z(e)(.srtuenp(S)i;z e\_ = =| ^0 ? ncclShmem.comm.buffSizes[NCCL_P/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hRO:T670O:_15S:I MPnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'E ]/NCCL_STEPS/size o670f | ( T ) :t isdt(etpiSdi)z,e _n) t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads), tidInBlock(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hh:r33e:a7d:I dnote: xin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here. x), group(gr o33 | u p ) , | ^~~~~~~~~~~~~~~~~p rims(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ht:i670d:,60 :n tnote: hfield 'group' will be initialized after field 'stepSize'r eads, & ri670n | g - > p rteivd,( t&irdi)n,g -n>ntehxrte,a dwso(rnkt-h>rseeanddsb)u,f ft,i dwIonrBkl-o>crke(ctvhbruefafd,I dwxo.rxk)-, group(group), | ^~~~~~~~~~~ >redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cppe:2: In file included from r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: _In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hb:145:y14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32__t gdatra1, flag1, odataup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ .x/WARP_SIZE; \ | ^ 28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndexIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIPROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | iIn file included from f (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.heduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 611670 | | tid(t id), n t hreads (nthreRads)u, tidInnBlWock(tohreadIrdx.x)k,B group(garoup), tch, | a s ltepS gi z e(stepStiziedo_(, tid), nthreads(nthreads), tidInB proto, unroll>().run(); \ | ^== 0 ? ncclShmem.comm.buffSizes[NClock(threadIdx.x), groupCL_PRO(TO_SgIMPLrE]/oNCCLu_STEpPS/s) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hi,:670: z15: enote: ofield 'nthreads' will be initialized after field 'tidInBlock' | 670 | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f t (id(Ttid)), n t| hr:e tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ad s(n sthtreea671pS | ds i) ze, _)ts idt{Ie np | BlS ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~oi zc| e group(groupk( /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hs(ttherpe:aS33idI:zdxe7:_. x) note: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here = gro=33 u0 | p (? g rn ouc c p)l ,Sphr i m| em ^~~~~~~~~~~~~~~~~sm( ./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.htcoid:m,670:m .n60:bt hunote: frefield 'group' will be initialized after field 'stepSize' fSiaz edss670[ | , N C C&rL_PROTO_SIMPLEi]ng-/>prNev, C&rinCg->nLex _ttidS(,tiTd ),E wntPhroeaSdrs(/ntkhsre-aids>)z, steideoInBfl(noTd)c b: ufsfkt, (etpwhSrieoazredIk_-d) >xr.{ex c) v, bgur| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group ff,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h: wo33rk->:r7eoup:d(gr Onote: poupin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereA), r g,| ^~~~~~~~~~~ 33 | prims(tid, nthreads0,, wor k->c&onnIrndex,i wnork-g>con-nInd>perev, &ring->nextx);, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63 :5: wonote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested herer k- 63>s | e n d bruuff, work->rencRingvLrLe_UdNORpOALLrg>,(t i0d,, wnotrhrke-a>dsc,on wnoIrkn)de; x | ^ ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h wo:rk432-:>c78:o nnote: nin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here I nd432e | x ) ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here if 63(tid | < s ubt n) R unWorrkCoulnRli(tid, nthreads, work); | ^L_ UNROLL>/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h().run(tid, subtn:, 432work:); | ^78: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here:7:1 : 432 | if (t note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herei 7d | DE FI().run(tid,FunIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0,cReduce, FunscPreuMulSbum, tflona,t , wNorCk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12CL_:ALGO1_RIN:G, N Cnote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMCLP_PROLTO_SIEM_PLE, P2) r| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.he:611:M62: note: uexpanded from macro 'DEFINE_ncclDevFunc' 611l | S RunuWorkmBatc_hf32_4, ncclFuncReduce, FuncPreMul, algof, prloto,o uanroltl work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ,> (N)C.CruLn_(AL)G; O\_ | ^RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatc h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h<:670c:15: onote: field 'nthreads' will be initialized after field 'tidInBlock' l670 | l ti,d(ti d), ty, nrthreeadds(ntohpread, algo, protloock(t,hrea dIdxu.nx),r grouop(lgrolup),> | ^~~~~~~~~~~~~~~~~ (/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:)60: note: .field 'group' will be initialized after field 'stepSize' r670 | u tind((t)i;d )\ , | n ^t h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hrea:670d:s15(n: tnote: hrfield 'nthreads' will be initialized after field 'tidInBlock'e ad670s | ) , tid(tid), nthreads(nthreads),tidInBlock(threadIdx.x) tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid, group(gro(up),t i | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t), itidInBdlock(th(readIdtx.x)i, groupd(grou)p),, | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RIN warnings generated when compiling for host. G, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Arg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 1010 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx1200. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, m/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1a: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15n: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14t: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77i:18: warning: unused variable 'y' [-Wunused-variable] 77s | s uiant32_t; y, he ad, man| tissa ^; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from 11/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h warnings generated when compiling for gfx942. :12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | In file included from ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2int : w = thIn file included from readIdx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h.x/WAR:P_SIZ11E; \ : | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads, nth(reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f64_2, ncclFuncReduce, Fun| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbufIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gf, workr->redOpoArg, 0, uwork->cponnInde(x, work->gconnrIndex); o | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hu:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested herep 63 | ) runRin,g(tid, nthreads, work); 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 432 | if ( 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.com:670:15: warning: minitializer order does not match the declaration order [-Wreorder-ctor] 670 | . tid(tibd), nthreads(nthreadsu), tidInBlfock(threadfIdx.x), grSoup(groupi), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | zstepSize(stes[NepSiCzCL_Pe_ ==R O0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->send/NbCCL_STEuPS/sizefof(Tf) : step,Size_ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w | group(group o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33r:7: note: kin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | - pri>ms(tid, rnthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ecvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre:611:62: ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRingnote: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFFuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ INE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCIn file included from CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt: stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prehreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ v, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: dop, algo, proto, unroll>().run(); \ | in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ up), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:5:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run algo, proto, unroll>().run(); \ | ^ (tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:_t dat75a1, flag1:, data2, f7lag2; | ^~~~~: warning: unused variable 'w' [-Wunused-variable] 75 | barrieIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();In file included from | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7f: warning: unused variable 'w' [-Wunused-variable] l75 | a barrierg_by_group1(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:,29:15: note: expanded from macro 'barrier_by_group' 29 | d const aint w = thtreadIdxa.x/WARP_S2IZE; \ ,| ^ In file included from flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xE/; \ | ^ WAIn file included from RP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t da const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, protreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBloco,k unroll(>().run(t); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'h 670 | rtid(tid)e, nthreaads(nthreadds), tidInIBlock(tdhxr.exa)d,I dgxroup(group),.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthre | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nth:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | re ads), tidIn Block(thr ead tid(tIdix.x), grd), nthreads(nthreads),oup(gro up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | idInB sltoepSize(steck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bufpSfize_ == 0S ? ncclShmiem.comm.buffSizes[NCCL_PROTO_SzIMPLE]e/NCCL_STEPS/ssizeof(T)[ : stepSizNe_) C{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupC /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33L:_7P:ROTO_SIMPLE]/NC note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here C33 | primLs_STEP(tidS/sizeof(T) , n:threads, &risntepSize_) {g->prev, &ring->nex | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->rt, weork->sendcbuff, wvork-b>recvubuff, fwork->rfedOpArg,, 0, wo rk->connwIndex,o work->connInderx); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hk:63->redOpArg, 0,:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | w runoRrk->connIningdconnIOnp, Prdoto, COeLx)L_; UNROLL>(ti | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, w note: oin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | r k if (t)id < subtn;) RunW o | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432r:kCo78ll, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().rgun(otid, ,subtn, w Prork); oto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: Dnote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here E12 | DEFINE_FncclDevFunIc(ReduceN_RING_SIEMPLE_Pre_MulSum_un32_c4,cl nDcevclFuFnunc(Reduce_cRedRuce, IFNG_SIMPLE_PreMuulncPreMulSSum, uinut3m2_u32_4, ncclFun_ct, NCCL_ALRGO_RINeGd,u cNeC,C LF_uPncPreMulSum,ROT O_SIMPuLE, i4n)t 3 2_t, NCCL| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' _ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, 611 | a RunWolrkBatchg, algop, proto, runoroll>().trun()o; \ | ^, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hu:670:15: nnote: field 'nthreads' will be initialized after field 'tidInBlock' r670 | o tid(tild)l>().run(); \ | ^, nthreads(nthreads), tidInBlo /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'ck( threadId x.x670 | ti), group(group), | d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff,10 warnings generated when compiling for host. work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h::1512: In file included from : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from ag/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp2:;2 : In file included from | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h ^~~~~: 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h::145145::1421:: warning: warning: unused variable 'data1' [-Wunused-variable]unused variable 'flag1' [-Wunused-variable] 145 | 145 | u i ntu3i2n_tt3 2d_att ad1a,t af1l,a gf1l,a gd1a,t ad2ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145, | f l a gu2i;n t 3| 2 ^~~~~_ t data1, fla/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hg:1145,: 28d:a twarning: aunused variable 'data2' [-Wunused-variable]2 , flag 2145; | | ^~~~~ uin/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.ht:31452:_28t: dwarning: aunused variable 'data2' [-Wunused-variable]t a1, f l145a | g 1 , duaitnat23,2 _ftl daagt2a;1 , | f ^~~~~l ag1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h,: 145d:a35t:a 2warning: ,unused variable 'flag2' [-Wunused-variable] flag2 ;145 | | ^~~~~ ui/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hn:t14532:_35t: dwarning: aunused variable 'flag2' [-Wunused-variable]t a1, fla g1451 | , d a tuai2n,t 3f2l_atg 2d;a t a| 1 ^~~~~, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work-In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from >redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2| : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from ^~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thre_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thread\ | ^ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:OLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:60: note: field 'group' will be initialized after field 'stepSize' tid(tid) , nthreads( nthreads)670, tidI | nBl ock( thr tid(teadIdixd), .nx), tghreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:nthreads), tidInBlock(thread670:15: Idx.x), group(gwarning: initializer order does not match the declaration order [-Wreorder-ctor] roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ = 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRingSIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | baIn file included from rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, daIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611In file included from | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Red/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizuce_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here e_R ING_SIMPLE_Prod_f16_2, ncc12 | DElFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 runRing(tid, nthreads, work); | ^ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 :| 78 group(group: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here432 | if ( t33i | d < s u bptrni)m s(RtuindW,o rnktChorlelaOppr,e vA,l g&or,i nPgr-o>tnoe,x tC,O LwLo_rUkN-R>OsLeLn>d(b)u.frfu,n (wtoirdk,- >sruebctvnb,u fwfo,r kw)o;r k -| > ^r edOpArg, 0, wor/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cppk:-12>:c1o:n nnote: Iin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested heren dex, work- >12c | oDnEnFIInNdEe_xn)c;c l D| e ^v Func(R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.he:d63u:c5e:_ Rnote: Iin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested hereN G_SIMP L63E | _ P r o dr_ufn1R6i_n4g,< Tn,c cRleFduOnpc,R ePdruocteo,, FCuOnLcLP_rUoNdR,O LhLa>l(ft,i dN,C CnLt_hArLeGaOd_sR,I NwGo,r kN)C;C L _| P ^R OTO_SIMPLE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h,: 4324:)78 : | note: ^in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :432611 | : 62 : note: expanded from macro 'DEFINE_ncclDevFunc' if (tid 611< | s u b tRnu)n WRournkWBoartkcCho, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:In file included from warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:unused variable 'w' [-Wunused-variable]2 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h: 8080: | 5 : warning: bunused variable 'w' [-Wunused-variable]a rrier_by_gro u80p | ( ) ; b| a ^~~~~~~~~~~~~~~~~~r rier_b/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hy:_29g:r15o:u pnote: (expanded from macro 'barrier_by_group'); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | 29 | c o ncsotn sitn ti nwt =w t=h rtehardeIaddxI.dxx/.WxA/RWPA_RSPI_ZSEI;Z E\; \| ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thrf(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.hw = threadIdx.x/WARP_SIZE; \ | ^ :80In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, :flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hfSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from onst i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2n: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:t11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: 173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:w75:7: warning: unused variable 'w' [-Wunused-variable] = 75 | t bahrrierr_bye_groaup()d; Idx.x/WARP_SIZ| E ^~~~~~~~~~~~~~~~~~ ;/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29: 15: note: \expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp :2| : ^In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h :11: In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_g/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: rIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11o: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174u: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:p7: (warning: unused variable 'w' [-Wunused-variable] )75 | ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: barrier_by_group(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grnote: expanded from macro 'barrier_by_group' o29 | const uint pw = (threa)dIdx;.x/W ARP_ SIZE| ; \ ^~~~~~~~~~~~~~~~~~ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const in);t | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:w29:15: note: expanded from macro 'barrier_by_group' =29 | ctonst hiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt rw = ethreaadIdxd.x/WIARP_dSIZEx; .\ x | ^ /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from 14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cppwarning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:: 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hwarning: :11: In file included from unused variable 'flag1' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145: 14: warning: unused variable 'data1' [-Wunused-variable] 145145 | | u int3 2_t data1 , fluag1,i datna2, tflag32; 2| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h_:145:21t:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp: 2: In file included from warning: d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11unused variable 'flag1' [-Wunused-variable]a: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h t:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.ha:145:1 14: ,warning: unused variable 'data1' [-Wunused-variable]145 145 | f | l uain tg32_ 1t da,uta i1, dnflaatg1ta2, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75flag2; | :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 3 2_t da145ta1, | fla g1, data2 , fl ag2;u | ^~~~~ i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145n:, dt28ata3:2, 2 fla_warning: g2;tunused variable 'data2' [-Wunused-variable] | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hd :145:21145: awarning: | unused variable 'flag1' [-Wunused-variable] t 145a | 1 u,i nt3uf2_til danata1tg, f31la2g,1 data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ , dat_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:2:In file included from 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: : warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hunused variable 'w' [-Wunused-variable]:11 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: 75:775: | warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp: 2b : a In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h r:br11ai: reIn file included from rr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hi_:eb175yr: __b/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:y80g_:rg5or:o uuwarning: pp(unused variable 'w' [-Wunused-variable]( )) ;; 80 | | | ^~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h b:29a::r2915r::15i: note: expanded from macro 'barrier_by_group'note: e expanded from macro 'barrier_by_group'r _by2929_ | | g r o u pcc(oo)nn;ss t t| i ^~~~~~~~~~~~~~~~~~ nint t w /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h w=: 29=t: h15tr:he readaInote: ddexpanded from macro 'barrier_by_group'Ix In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ d x.x.29/x | W/ AW R PA _RcoSInPZs_EtS; I iZ\n Et ; | ^ w\ = threadI d| ^x .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:29 warning: unused variable 'w' [-Wunused-variable] 75 | | ba rrier_by _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29 const :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t145 | uindt32_t dataa1, flag1t, data2, aflag2; 1| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:,21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uintf32_t data1,l flag1, adata2, fglag2; | ^~~~~1 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28, data2,: warning: unused variable 'data2' [-Wunused-variable] flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145 145: | uin28t32_t da:ta1, warning: unused variable 'data2' [-Wunused-variable] fl145 | ag1, uintdata23, 2flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ _t data1ag2; ,| ^~~~~ f /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step S | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | In file included from stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_ProIn file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp15:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInd_f8_B2, ncclFuncReduce, lFuncProd, roccl_float8c,k(threadIdx.x), grou NCCLp_ALGO_RING(, NCCL_Pgroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ROTO_LL128, 2) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize( ste| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hp:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' S611 | Rize_ == 0 ?unWor kBatchncclShme, zes[NCCLalgo, proto, unroll>().run(); \ _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:| ^ 33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, workIn file included from );/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h :11: In file included from | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h ^:670:15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp 670 | :5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f 8 tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepS_i2, nczclFunecRedu(ce, FsuncPtrod, ercclp_floatS8, ize_ == 0 ? ncclShmem.comNCCLm_ALGO._RINbG, NCuCL_PRfOTO_LfL1Siz28, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch: st, algo, proto, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hunroll>().run(); \ :670:15:e pSizewarning: _) { initializer order does not match the declaration order [-Wreorder-ctor] | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h :33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 67033 | | p rims(tid, nthreads, &ring->prev, &ring->| ^ next, work->send b tuidf(tfid),, n thrweados(nrthrkead-s),> tirdIneBlcocvk(tbhruff, work->redOpAreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[g, N0, work->CconnCIndLex,_ woPrk-R>coOnnITndeOx);_ | ^S /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hI:63:M5: Pnote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here L 63E | ] r/unRNingC(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunW) o: srtepkSizCe_)o { l | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l <| group(group F/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:n33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recv,b T,u RefdOpf, A,lgo , Pwrotoo, CrOLL_kUNR-OLL>>()r.ruen(tdid,O supbtnA, worrkg);, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp0:7:,1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here w 7o | DErFkIN->connIndexE,_nc clDwevFouncr(Rek->connIndex)duce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, ; | r ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hc:63c:5:l note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here_ 63f | l ruonRaingt(LtGO_id, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollE, (2) ) | .^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), truni(tidd, Isubtnn, Bworlk);o | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prock(dthr_eadfIdx8.x)_, g4rou,p ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | (group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &10ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, wo warnings generated when compiling for host. rk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] u670p), | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce,em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | >(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | ui/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' nt32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h::432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().rOun(tiTd, subtn, wOork); _ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cppS:12I:1: note: Min instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12P | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTLE]/NOCCL_S_TEPS/SsizeoIf(T) M: stePpSizeL_) { E| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, wo, 4) r | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hk->sendbuff, work->recvbuff, work->redOpArg, 0, w:611:o62: note: expanded from macro 'DEFINE_ncclDevFunc'r 611k | RunWo-rkBa>tch, nalgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gdex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work-, tidInBlock(threadIdx.x), group(gro>uconnIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | tDid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ EFI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_hreadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hx:33:7: note: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | , pri ms(tid,g nthreards, &roing->pruev, &ripng->nex(t, wogrk->senrdbuff, owork->ruecvbuffp, work-)>redOpA,rg, 0, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =work->c=onnIndex , work-0>connI ndex); ?| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h ncclShmem.c:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | omm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here ru nRing(ti d, nth reads, work) ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hp:432:r78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here i432 | m if s(tid <( sutid, nthreads, &ring->prev, &ring->next, work->sendbbtnu) RunWforkCofll(-).run>recvbuff, work-(tid, subtn, work); | >redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, 1, 2, 2>::run' requested here o7 | D,EFIN E_ncCclDeOvFuncL(RedLuc_e_RIUNG_SNIMPLRE_PrOod_uL32_L2, nc>clFu(nctReduice, dFunc,Prod , uinnt32t_t, hNCreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCoCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_uoll3, t2y, _red4o,p,n alcgo,c prlotoF, uunrnollc>()R.ruen()d; \u | c ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.he:,670: 15: note: Ffield 'nthreads' will be initialized after field 'tidInBlock' u 670 | n c tiPd(trid)o, ndthr,ead s(nuthrieadns),t ti3dIn2Blo_ck(tthr,ead IdNx.xC), CgroLup(g_rouAp),L | G ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hO:670_:60: Rnote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().rthrueadns((nth)read;s), ti\dIn Blo ck(| thr ^ea d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIdx.x:), 670gro:up(15gro:up) , note: | ^~~~~~~~~~~field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, wo:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nrk->csendbufcf, workl->recSvbuff, hwork->remdOpArg,e 0, wormk->conn.Index, cwork->oconnIndmex); | ^m /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here .63 | b runRingu(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpnote: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here A432 | r ifg (tid, < su btn) R0unWor,kColl _UNROcLL>().orun(tnid,n subtIn, wornk); d| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cppe:12:1:x note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncR, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here e duce, 432Func | Prod , ui nt32 _t, NCCL_ ALGO _RINiG, NfCCL_ PR(OTO_StIMPLiE, 4d) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: , alguo, nprotWorkColl().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tioto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(d(tRid), enthrdeadsu(nthcreades)_, tiRdInBIlockN(thrGeadI_dx.xS), gIroupMPLE_Pro(grdoup_), u | ^~~~~~~~~~~~~~~~~ 3/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:2670:60:_ note: field 'group' will be initialized after field 'stepSize'4 670, | tnid(ctidc)lFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPL, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx906. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' ata1, 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group();In file included from | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, fIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(t work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().]run(ti/d, subtNn, worCk); | ^C /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7L:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here_STEPS/sizeof(T) : stepSiz 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, Ne_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: CCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thrin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, sueadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ btn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreadCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_)s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PRIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611OTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: PLE, initializer order does not match the declaration order [-Wreorder-ctor]2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611: 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | 670RunWor | kBatch , altgo, prioto, udnroll>(()tid), nthreads(nthreads), tidInBlock(.run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nIn file included from t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, whreads(nthreads), tidInBlock(threadIdx.x), ork->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ grouthreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | p (group ), | p ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hr:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIIn file included from dx.x/W/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from A/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173R: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: Punused variable 'w' [-Wunused-variable] 75 | _ barrieSr_bIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:y_group173(); | ^~~~~~~~~~~~~~~~~~ : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hexpanded from macro 'barrier_by_group':75:7: warning: unused variable 'w' [-Wunused-variable] 75 | 29 | c onst barrier_by_group(); | int ^~~~~~~~~~~~~~~~~~ w = thre adIdx.x//builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hWARP_SIZE:29:15: note: expanded from macro 'barrier_by_group' 29 | const ; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hIn file included from :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h::11: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75unused variable 'w' [-Wunused-variable]:7: warning: unused variable 'w' [-Wunused-variable] 75 | 75 | b barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from con/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hst int w: =11: In file included from threadI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hdx.x/WARP:_SIZE; \ 174 | ^ : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: :14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, da:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 32_t data1, flag1, datata22, fl,ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hf:145:35: lwarning: ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | unused variable 'flag2' [-Wunused-variable] 145 | uuint32_it dnt32_t datata1a, fla1g1, d,ata2, flaflag1, g2;d | ^~~~~a ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | ba/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpprrier_by_gr:oup(); | ^~~~~~~~~~~~~~~~~~2 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: 29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h :c75:7: onst int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),IMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn,work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tNEidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFIIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(NE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroltid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ l>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, uint32_t y, head,m antissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.hm:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77a:18: warning: unused variable 'y' [-Wunused-variable] n 77 | t uiinssa; t| 32_t ^ y, h ead, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from 11 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] dx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75: 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from Idx.x//builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:W2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hA:11: In file included from R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: P/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:_14: warning: Sunused variable 'data1' [-Wunused-variable] 145I | Z uinEt32_;t \ | ^ In file included from data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bar | ^ rier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from st int w = threadIdx.x/WARP_SIZE;/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:\11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h | ^ :145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:35: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2warning: : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:unused variable 'flag2' [-Wunused-variable]11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h :145:14: warning: unused variable 'data1' [-Wunused-variable] 145145 | | uint 32_t dat a1, f lagu1, daita2,n flatg2; 3 | ^~~~~2 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] _145t data1, flag1, data2, | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h flag2; | ^~~~~ :145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: cIn file included from ons/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.ht i:nt w = threadIdx.x/WARP_SIZE; \ | ^ 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndeIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7x); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) Run:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadsOTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRi), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ng(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCC/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, u:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tidnro)ll>().,run(); \ | ^ n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:t670:15: hreads(nthreanote: field 'nthreads' will be initialized after field 'tidInBlock'd 670 | s ti)d(ti, tidInBlock(threadd), nIthreadds(nthrxeads).x), , tidgroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[InBlNock(thCreadIdCx.x), Lgroup(g_roup), P | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hR:670:O60: note: field 'group' will be initialized after field 'stepSize' T 670 | O tid(_tid)S, nthrIeads(nMthreadPs), tiLdInBloEck(t]/NCCL_STEPShrea/dsizeof(T) : stepSIdix.x), zgroup(egroup)_, | ) ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, protIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ o, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hI:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] E, 4) | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, dataIn file included from 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;In file included from \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group':2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx. x29 | co/nst intW w = thAreadIdx.Rx/WARPP_SIZE; _\ | ^ SIn file included from IZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, da/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cppt:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:a11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:1752: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80,:5: warning: unused variable 'w' [-Wunused-variable] 80 | f balrrier_aby_grogu2; p();| | ^~~~~~~~~~~~~~~~~~ ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: iIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hn:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80t:5: warning: unused variable 'w' [-Wunused-variable] 380 | 2b_at data1, flag1, rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const inIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->pIn file included from rev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ coll, ty, redop, algo, proto, alg,o, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid( unroltid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ l>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2In file included from : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90610 warnings generated when compiling for gfx908. . 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uinIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ t32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp::14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: 1: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12unused variable 'y' [-Wunused-variable] 77 | u: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:n77:18:t32_t y, head, warning: unused variable 'y' [-Wunused-variable] 77 | m uiant32_t nty, head, mantissa; | ^ issa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from 10 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from barri/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2e: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:r11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | con:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h, data2, flag:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:= threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp: data1, 2f: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 10 warnings generateds when compiling for host. tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60In file included from :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:lShmem.co173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hto:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, un, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tirdoll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring-In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: >prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex)In file included from ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWor/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cppkCol:l, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grL>o().rup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm | ^ ./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buffSizes[NCCL_PROTOun_(tidS, suIbtn, MworkP); LE]/NCC| ^ L/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp_:7:1S: note: Tin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here E7 | PDEFSINE/_ncsclDeivFuznc(eRedoucefSca(tterT_RI)NG_S IM:PL E_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :611:62: 34note: expanded from macro 'DEFINE_ncclDevFunc' | 611 | Ru nWo rkB atch ,e ads, &ring->algpo, prroteo, unrvol, &ring->next, work->sendbuff,l >(w).orunr(); k\ -| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15>: note: recvbuff, worfield 'nthreads' will be initialized after field 'tidInBlock' k -670 | > r teidd(tOidp),A nrthgre,ad s(0nt,hr eawdso), rtikdI-nB>loccko(nnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hthreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRingnt(hrteaidsd),, t idnInBloctk(htrhreeadaIddxs, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: .note: x)in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here, gr oup(432gr | ou p) , | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: OLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, sufield 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ btn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t dataIn file included from 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp;:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75: 7: warning: unused variable 'w' [-Wunused-variable] | 75 | ^~~~~barrier_ by_grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145p(); | | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: uint32_t da29:ta1, 15: note: expanded from macro 'barrier_by_group' f29 | conslag1, dat intt w =a thre2adIdx, flag2; | ^~~~~ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here P_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); 34 | prims(tid, nthreads, &ring->prev, &rin | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ g->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 432 670 | : tid(t78id), nt:hread note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColls(nthr().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4 , step Size(nstepSicze_ =c= 0 ? ncclShmeml.comm.FbuffSuizes[nNCCL_cPROTO_RSIMPLeE]/dNCCL_uSTEPSc/sizeeof(TS) : sctepSiaze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.ht:34:7: enote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here r34 | , pri msFuncMinMax, doubl(tid, nthreads, &ring->prev, &ring->next, e, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrwoork-l>senldb>uff,( wor)k->recvbuff., worrk->urednOpAr(g, 0), wo;rk ->con\nInd ex, work| ->c ^onnI ndex/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65::5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here670 65: | 15 run:Rin note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreag(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncRIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~educeScatter, FuncMinMax, double, NCCL_ALG O | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid:670:15 : warning: ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncRedTEPSu/sizceof(eT) :S stecpSizea_) {t | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t | group(groupe /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hr:34:7,: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34F | u pnrimsc(tidM, ntihreands, M&riang->pxrev,, &ri ng->dnexto, wourk->bsendlbufef, w,ork- >rNCCL_ALGO_RING, NCCL_PROecvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | ruTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), n 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611thread10s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMtPid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE]/NCCL_STEPS/sizeof(T) : stepSiz warnings generated when compiling for hoste. _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_E_MSinMaxT_f64E_2,P nccSlFun/cResduceiScatzter,e FunocMinfMax,( dToubl)e, NC CL:_ALG O_RIsNG, tNCCLe_PROpTO_SSIMPize_) { LE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll> | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | ().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nruntRinghr(tied, nathrdeads), tidInBlock(threadIdx.xs, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn)), group(group), | ^~~~~~~~~~~ RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFI:NE_n670cclD:evFu15nc(R:educ eScawarning: tterinitializer order does not match the declaration order [-Wreorder-ctor]_RIN G_SI MPLE_MinMa670x_f6 | 4_2, ncc lFun cRedu ceSctatteir, FduncM(inMatx, diod), nthreads(nthreads), tidInBlock(threadIdx.xuble, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthread^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hs:611:,62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:&ring->prev, &ring->next, work->sendbuff, 60: note: wfield 'group' will be initialized after field 'stepSize' o670 | r tk->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndeidx(ti)d), ;nthr ead s(nt| hrea ^ds), ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hdInBlock(thr:eadI65dx.x:), g5roup:( note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreadgroup), | ^~~~~~~~~~~ s, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ^ 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h: flag111, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t dat10a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28:: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thr warning: unused variable 'data2' [-Wunused-variable] warning 145 | se generated when compiling for gfx1101. adIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h::11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h174:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h: :75/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:7: warning: :unused variable 'w' [-Wunused-variable] 7575 | :7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | b baarrierr_by_rgroupier(_)b;y _g | ^~~~~~~~~~~~~~~~~~rou /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:p29(:)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: 15: note: expanded from macro 'barrier_by_group' 29 | const int w/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | baexpanded from macro 'barrier_by_group' r29 | r iconset inrt w_ = thbreyadIdx_.xg/roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:WARP15_SIZE:; \ | ^ note: expanded from macro 'barrier_by_group' = t hreadIdx.29x/ | WARP_SI ZE; \ | const ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from ZE;/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h\:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75| | barrier ^ _by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from 2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11_: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174t: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145 :14: warning: dunused variable 'data1' [-Wunused-variable] ata1, 145 | f ulint32_tag1, data2, flag2; | ^~~~~ data 1, fl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hag1,: data1452, f:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flaglag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h1:145,:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | d uaint32t_t daata12, fl,ag1, datfala2,g 2f;l a g| 2 ^~~~~; /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h | : ^~~~~145 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:35::145 :warning: 28:unused variable 'flag2' [-Wunused-variable] warning: 145 | uunused variable 'data2' [-Wunused-variable] i 145n | uint32t32_t data1, f_tl daata1g, fla1,g1, data2, data2, flag2; | ^~~~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(:2: )In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11;: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80| : ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h5: :warning: 29:unused variable 'w' [-Wunused-variable]15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | 80 | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALs), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ GO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ Index); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeoIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBflock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connInde11 warnings generated when compiling for gfx90a. x); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->co/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nnIndex); | ^11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18:In file included from warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from dIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cppIn file included from :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barri/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flIn file included from ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h: 80:5: bwarning: unused variable 'w' [-Wunused-variable] 80 | a r barrier_by_rgrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cooup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, workpS-ize(s>tepSize_r == 0 ? ncclShmecvbuff, work->redOpAem.cromm.buffSgizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.heads(nthreads), tidInBlock:670:15: warning: (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx906. 1010 warnings generated when compiling for gfx908. warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r o:u ps)t,e p S| i ^~~~~~~~~~~~~~~~~z e_)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :{670 : 60:| note: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~field 'group' will be initialized after field 'stepSize' | group(group 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h(t:h34r:e7a:d Inote: dxin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here. x), group(group), 34| | ^~~~~~~~~~~ prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 d?) ,nc cnltShhrmeeamd.sc(onmtmh.rbeuafdfsS)i,z etsi[dNICnCBLl_oPcRkO(TtOh_rSeIaMdPILdEx]./xN)C,C Lg_rSoTuEpP(Sg/rsoiuzpe)o,f ( T| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ : s| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_e pSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 671 | | group(group stepSize(stepSize_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h=:=34 :07 :? nnote: cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested herec lShmem.comm. b34u | f f S i z e sp[rNiCmCsL(_tPiRdO,T On_tShIrMePaLdEs],/ N&CrCiLn_gS-T>EpPrSe/vs,i z&eroifn(gT-)> n:e xstt,e pwSoirzke-_>)s e{n d b| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f f ,| group(groupw ork->recvbuff, work->redOpArg,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :034,: 7w:o rnote: kin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here- >connIndex, wo r34k | - > c o n nIpnrdiemxs)(;t i d| , ^ nthrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hd:s65,: 5&:r inote: nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested hereg ->prev, 65& | r i n g -r>unneRxitn,g dsOepn,d bPurfoft,o ,w oCrOkL-L>_rUeNcRvObLuLf>f(,t iwdo,r kn-t>hrreedaOdpsA,r gw,o r0k,) ;w o r| k ^- >connIndex,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :w432o:r78k:- >note: cin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereo nnIndex); 432 | | ^ if/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :(t65i:d5 :< note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested heres ubtn) R u65n | W o r k CorlulnN(RtOiLdL,> (n)t.hrruena(dtsi,d ,w osrukb)t;n , | w ^o rk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cppin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here:12: 1:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from E; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t datIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 11 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hnthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbu).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, workIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->nexint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, t, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75In file included from :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from ; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(In file included from th/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.headIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: 173In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_byIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrneta dwI d=x t.hxr/eWadAIdRxP._x/SWIAZRPE_;S I\ZE ; | \ ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2 : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ a 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group lgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreadup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here s, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 11 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | SIMP L E _wParrepMIunlBSluomc_kf(6t4h_r4e,a dnIcdcxl.Fxu/nWcARRePd_uScIeZSEc)a,t t e| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, F| u warp(tid/WARP_SIZEn cPreMul S498u | m , d ofulbalgeT,h rNeCaCdL(_(AtLiGdO%_4R)I=N=G3,) ,N CgCrLo_uPpR(OgTrOo_uSpI)M,P L E| , ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ 4 )| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 | ^ 499 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h : 611s:t62e:p Snote: iexpanded from macro 'DEFINE_ncclDevFunc'z e(ncclS h611m | e m . c oRmumn.WbourfkfBSaitzcehs<[cNoClClL,_ PtRyO,T Or_eLdLo1p2<8t]y/>N,C CaLl_gSoT,E PpSr/ostioz,e oufn(ruoilnlt>6(4)_.tr)u)n ({) ; | \ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :67034 | : 7 : note: tin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested herei d(tid), nthr e34a | d s ( n t h rperaidmss)(,t itdi,d InntBhlroecakd(st,h r&eraidnIgd-x>.pxr)e,v ,g r&oruipn(gg-r>onuepx)t,, w| o ^~~~~~~~~~~~~~~~~r k->s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.he:n670d:b60u:f fnote: ,field 'group' will be initialized after field 'stepSize' work->r e670c | v b u f ft,i dw(otrikd-)>,r endtOhprAeragd,s (0n,t hwroerakd-s>)c,o ntniIdnIdneBxl,o cwko(rtkh-r>ecaodnIndIxn.dxe)x,) ;g r o| u ^p (group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid),/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34: tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2:670:15: : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611u:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp 10 warnings generated when compiling for gfx1101. 1010 warnings generated when compiling for gfx1030. warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14In file included from : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uiIn file included from nt32_t data1, fIn file included from l/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here ) 7 | DEFI,NE_nccl DevFuncw(ReduceiScatterd_RING(_SIMPLE_tPreMulSium_f8_2d, ncclF%uncReducWeSAcatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groupPROTO(_LL128]g/NCCL_SroTEPS/suizeof(upint64_)t)) { ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :34: 7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, worktid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().ruIn file included from n(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &rin/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); g->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hexpanded from macro 'barrier_by_group':11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | 29 bar | rier_by _group( con); | ^~~~~~~~~~~~~~~~~~st /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h int w = threa:29:15: note: dexpanded from macro 'barrier_by_group' 29 | I dx.x/WARP _ const SiIZE; nt\ w = | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const in/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const intt w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum,In file included from uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nth/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:r2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11e: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:d warning: initializer order does not match the declaration order [-Wreorder-ctor] s670 | ( tid(ntid), ntthreadhs(nthrerads), etidInBalock(thdreadIdsx.x), g)roup(g,roup), tidInBlock | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_( 671 | threadIdx ste.pSix), group(grouzep(stepS)ize_ ==, | 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SI ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouMPLE]p/NCCL_STEPS/sizeof(T(group), | ^~~~~~~~~~~ ) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pro:t670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]o , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group | (stepSgize(srtepSioze_ =u= 0 ?p ncclShmem.comm).bu,ffSi zes [NC| CL_PR ^~~~~~~~~~~OTO_ SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulIn file included from Sum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: RIn file included from ING,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h NCC:L_PR173OTO_: SIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hPLE,: 4) 670 | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h::611:1562: note: :expanded from macro 'DEFINE_ncclDevFunc' 611 | warning: Ruinitializer order does not match the declaration order [-Wreorder-ctor]nWor kBat ch<670 | coll, ty, redop, algo, proto, unroll>().run tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? n()c; c\ l| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hS:670:h15: mnote: field 'nthreads' will be initialized after field 'tidInBlock' e670 | m ti.d(ticd), onthrmeadsm(nt.hreadbs), utidIfnBlofck(tShreaidIdxz.x), egrousp(gr[oup)N, | C ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hC:670:60:L _PROTO_SIMPLE]/NCCL_STEPS/sizeonote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gf(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redrouOp), p | ^~~~~~~~~~~A rg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(td), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work-In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h>connInde:x670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h10 warnings generated when compiling for host. :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/NCCL_STEPS/size:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tidof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->c),o nnthreadsn(nthreaIds), tnidInBlodck(threaedIdx.x)x, group,( work->connIndex); group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here o34 | , p rims(Ptid,r nthroeads, t&ringo->pre,v, &r ing->CnextOLL_UNROLL>(), w.ork-r>senudbufnf, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: (tidin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here, su btn, work); | ^ 65/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, | runFRingu(tlid, Snthrueadsm, wo,rk); | ^u /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hint32_t, NCCL_ALGO_RING, NCCL_PROTO_SI:432:M78: note: Pin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, 611 | s u RubnWotrkBantch,< cwollo, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | rk ); | ^t /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cppid:12:(1: tnote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here i 12d | DE)FINE_n,ccl DevnFuntc(RhedurceScatteer_aRINdG_SsIMP(LE_nthreadsP)reM,ulS um_tuidInBlock(th32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | In file included from ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1201. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t In file included from In file included from data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connInIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nth/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hr:670:60: enote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | w = threadIdx.x/WAR P_SIZE; \ | ^ const int w = In file included from threadIdx.x/WARP_SIZE; \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2 | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hIn file included from :173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h warning: unused variable 'w' [-Wunused-variable] 75 | : barr11ier_by_g: roup(); | ^~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h 29 | c:onst int 174w = : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable]thr eadIdx. 145 | uint32_t data1, flag1, data2, flag2; | x/WARP_SIZE; \ | ^ ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, dataIn file included from 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35:In file included from warning: unused variable 'flag2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->cIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable]onnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tidIn file included from (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatconnIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(titer, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pIn file included from roto, unr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11oll>().run(); \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | iIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), f (tid < subtn) RunWorkColl().trun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl() .run(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hid, subtn, w:ork); 29| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp::7:151: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here :7 | DEFI NE_ncclnote: DevFunexpanded from macro 'barrier_by_group'c(ReduceScatter _RING_ SIMPLE_Prod_bf298_2, nc | clFuncR educe Scatter , Func Prconod, rccl_st int w = threadIdx.x/WARP_SIZE; \ | ^ bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'In file included from 611 | RunWorkBatch, algo, proto, unroll>().ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from n(); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h\ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:nth21reads:), ti dInBwarning: lockunused variable 'flag1' [-Wunused-variable](threa dIdx .x), group(g145roup | ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:60 : note: field 'group' will be initialized after field 'stepSize' uint32_670 | t ti data1, flag1, data2, flagd2(ti;d), nth reads| (nthr ^~~~~eads), t /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uintidInB3lock2(thre_adItdx.x) , grdata1, foup(lgroupa)g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35, | ^~~~~~~~~~~ : warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthIn file included from r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bufIn file included from fSizes[N/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:C11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hCL_PROTO_SIMPLE]/NCCL_STE:174P: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: Swarning: unused variable 'data1' [-Wunused-variable] 145/ | uinst32izeof(T) : stepSize_) { _t | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h data1, f:lag1, da34ta2, fla:g2;7 | ^~~~~ : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] prims(t 145 | i uint3d, nthreads, &ring->2_t data1, flag1, daprev, &ring->next, work->sendbuff, work->recvbufta2f, flag2;, | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h :145:28: wwarning: unused variable 'data2' [-Wunused-variable] 145 | o uint32r_t data1k, flag->re1, ddOpArg, 0ata2, fl,ag work->con2; n| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:I145ndex:,35: work->connIn warning: unused variable 'flag2' [-Wunused-variable] 145 | d ex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65: uint32_5t data1,: fla note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here g1, da65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run( subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, u); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'MPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthr e 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ zes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeofIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34In file included from | prims(tid, nthreads, &ring->prev, &ring-/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | >next, work->sen: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ dbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 barrie | r_by _gr oup();r | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hu:29:15: note: expanded from macro 'barrier_by_group' n 29 | cRonst init w = tnhrg(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); data1 | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nt, hwork->senrdbuff, woerk->recavbuff, wdork->redOpArg, 0, work->csonnIndex(, work->nconnIndext); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hh:65:5:r note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | e runRinag(tid, nthretidInBlock(threadIdx.xads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | pri: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herem 432 | s i(f (tid t< subtin) RunWdorkCo,ll().rudn(tisd, subt,n, wo rk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f1&ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h6_2,: nccl65FuncR:educe5ScattIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RINGe:r, F uncPnote: rod, in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested herehalf, NC CL_ALGO_R65ING, N | CCL_ PROTO _SI MPLE, 2) | ^r /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611u:62: note: nexpanded from macro 'DEFINE_ncclDevFunc' 611R | iRunWonrkBatgch, aRledOp, Proto, COLL_UNROLL>(tid, nthreads,go, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Alnote: gfield 'nthreads' will be initialized after field 'tidInBlock' o670 | , tiPd(trid)o, ntthroead,s( nthCreaOds)L, tLidI_nBlUocNk(tRhreaOdIdLx.xL), >gro(up()gr.oupr), u | ^~~~~~~~~~~~~~~~~ n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:(670:60t: inote: field 'group' will be initialized after field 'stepSize' d 670, | s tiud(tbid),tn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SI nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFuncIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11(ReduceScatter_RING_SIMPLE_Prod_f16_4, n: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, In file included from &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, workidI)nBlock;(thr eadIdx .x), g| roup(g ^roup), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_S (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_IMPLPE]/NCCLr_STEoPS/sizdeof(T) _: stepfSize_)1 { | 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group_ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:434:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here, 34 | prnims(ticd, nthcreads,l &ring-F>prev,u &ringn->nextc, woReduceScrk->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex,atter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtSnizes[)NCCL_ PRORTO_SIuMPLE]n/NCCL_WSTEPSo/sirzeof(kT) : CstepSoize_)l { | l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group , FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here ,34 | prTims(t, RedOp, Algo, Protid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, o, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatt0, woerk->cronn,Index, woFrk->counnInndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hc:65:5: Pnote: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65r | o rundRing<,T, Red Op,half, NCCL_ALGO_RING, NCCL_PROTO_S Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Size(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp::2173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75: :In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: warning: unused variable 'w' [-Wunused-variable] 75 | barri:er_by_g29roup(:)15: note: expanded from macro 'barrier_by_group' 29 | cons; | ^~~~~~~~~~~~~~~~~~ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | i nt w const int w = threadIdx.x/WARP_SIZE; = threadIdx.x/WARP_SIZE; \ | ^ \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from 75/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h::175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:807:5: warning: unused variable 'w' [-Wunused-variable]: 80 | barrier_ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | by_gr oup()c; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.ho:29:15:n note: expanded from macro 'barrier_by_group' s 29 | t const int w i= threnadIdx.tx/WAR P_SIZE;w = th\ | r ^ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp); | : ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:229:15: : note: expanded from macro 'barrier_by_group' In file included from 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h cons:t in11t w =: threIn file included from adIdx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h.x/WA:RP_SIZ173E; \: | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ dIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | LE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hduceScatter, F:u670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd,ncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg,ff 0, woSrk->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff[ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o , work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FunIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here cProd, floa 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498IM:PL29E]/:NCC L_Swarning: TEPfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]S/s iz eof(T) :496 st | epSi ze_ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| group(group i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hd:34:7(: note: tin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here i34 | d ) ,pr imsn(titd, hnthrreaeds,a &rdings->p(revn, &trinhg->rnexet, awordks), wid(tid%WARP_SIZE), warp(t->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex,id/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group( gworrko-u>cpon)nI,nd ex); | | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:| 65: warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==35: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 49965 | | ru nR insg(etmid., cntohrmeamds., bworuk)f; f | S ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hiz:432:e78:s note: [Nin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here C C432 | L _ P R iOf T(tOid_ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f32_2, ncclFuncReduceSIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)catter, FuncProd, float, NCCL_, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tid if (tiInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/d < subtnsizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid <) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7lgo, Proto, COLL_UNROLL>:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unro().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunW/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkColl().run(tid, subtn, work), 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nt/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, alg/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ho, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthre:670:a15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ds), tidInBlock(threadIdx.x 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S), group(group), | ^~~~~~~~~~~ TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from uint32_t data/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp1:,2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h flag1,: data2, 80fl:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrag2;i | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:e145:21: warning: unused variable 'flag1' [-Wunused-variable]r_ 145 | uint32_t data1, flag1, data2, flby_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti if d(tid <( subtntid), nthre) aRunWorkdColl()a.run(tdid, ssubtn, w)ork); , | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp :tidInBlock(threadIdx.x), group(group5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f64_2, ncclFunc), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]Red/uceScNCCL_STEPatSter, F/uncPsrod, dioublez, NCCLe_ALGO_RoING, NfCCL_P(ROTO_LTL128, )2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h: stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | :611 :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | p RunrWorkBaitch, aid, nthreads, &lgo, proto, unroll>().run(); \ | ^ ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uinIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :670:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hwarning: :34initializer order does not match the declaration order [-Wreorder-ctor]: 7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | p670r | i m s ( titdi,d (nttihdr)e,ad sn,t h&rreiandgs-(>nptrherve,a d&sr)i,n gt-i>dnIenxBtl,o cwko(rtkh-r>esaednIddbxu.fxf),, wgorroku-p>(rgercovubpuf)f,, w| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r k -| > tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_r edOpArg, 0 ,671 | w o r k -s>tceopnSniIzned(esxt,e wpoSrikz-e>_c o=n=n I0n d?e xn)c;c l S| h ^m em.co/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hm:m65.:b5u:f fnote: Sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested herei zes[NC C65L | _ P R O TrOu_nSRIiMnPgLp(tSiidz,e _n)t h{r e a| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s ,| group(groupw ork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :note: 34in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here: 7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 432 | i f34 | (t i d < psruibtmns)( tRiudn,W onrtkhCroelaldOpp,r eAvl,g o&,r iPnrgo-t>on,e xCtO,L Lw_oUrNkR-O>LsLe>n(d)b.urfufn,( twiodr,k -s>urbetcnv,b uwfofr,k )w;o r k| - ^> redOpA/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cppr:g7,: 1:0 ,note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herew ork->connInd e7x | ,DE FwoIrNEk_-n>cccolnDneIvnFduenxc()R;e d u| ^c eScatte/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hr:_65R:I5N:G _note: Sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested hereI MPLE_Pr o65d | _ f 8 _ 2r,u nnRcicnlgFc(lt_ifdl,o antt8h,r eNaCdCsL,_ AwLoGrOk_)R;I N| G ^, NCCL_PRO/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hT:O432_:S78I:M Pnote: Lin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereE , 2) | ^ 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h : 611 :i62f: (note: texpanded from macro 'DEFINE_ncclDevFunc' id < su b611t | n ) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from In file included from 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:| 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre tiad(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from 14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRIn file included from OTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, w/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gorrk->reodOpArgu, 0, worpk->conn)Index, ,work->c o | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | snnItndex); e | ^ p/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:Size(stepSize_ == 0 ? ncclShmem65:.5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRingc(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | | if (t id < subptn) RunrWorkColilprevg, &ro, Piroto, CnOLL_gUNROLL->().r>next, work->sendbuff, workun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ SIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hPLE_Prod_u64_4, :ncclFu432n:78:cReduceScatter, FuncProd, uint64_t, NCCL_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, p note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RINGroto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), n_StIMPLhE_Prrode_u64a_2, dncclsFunc(ReducneScattterh, FurncPreod, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, prads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFun:670c:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_S(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuedop, algo,f prot,o, unrol l>().ruwn(); \ o | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:r670:15: note: kfield 'nthreads' will be initialized after field 'tidInBlock' 670 | - tid(>tid), ntrhreads(enthreadsd), tidOInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), pArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:nth432read:s(nt78hrea:ds), tidInote: nBloin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereck(t hr eadIdx.x432), | grou p(gro up), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIstepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, wdx.x), group(group), | ^~~~~~~~~~~ ork->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_Sy>, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCLIn file included from _ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hi:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: (warning: initializer order does not match the declaration order [-Wreorder-ctor] 670t | tidi(tid), dnthread)s(nthr, nthreads(nthreads), tidInBlock(threades), tidInaBlock(thdreadIdxI.x), gdroup(grxou.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bufIn file included from fSizes[NCCL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h_:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173P: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15R: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid),O nthreadTs(nthreOads),_ tidInBSlock(thIreadIdxM.PLx), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | E]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work-> tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_s e671 | n stepdSize(bstepSuize_ f== 0 f? ncc,lShmem .comwm.buoffSizres[NCkCL_PR-OTO_SI>MPLE]r/NCCLe_STEPSc/sizevof(T)b : stuepSizfe_) {f | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ , | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h work->redOpArg, 0, work->connIndex, wo:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &rrk->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollnext, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432OLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: :78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndext, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatc)h; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connInIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: In file included from warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx90a. :14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScIn file included from atter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(t tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_Sd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEIn file included from PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: , nthreads(nthin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->reads)prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | , tidInBlocrunRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here k(threadIdx.x), group( 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here group), 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ stepSize(s/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 1010 warnings generated when compiling for gfx908. warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fla:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSizIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, worIn file included from k->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp: 2: (In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]i 670 | d ti d(ti().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, Funcomm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, Sum, halwf, NCoCL_ALGrO_RINGk, NCCL-_PROT>O_SIMsPLE,e 4) n | d^bu /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hf:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrof, lwork-l>recvbuff, work->red>OpA(rg, 0, )work->.connIndrex, wourn(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' k ->connIn670dex); | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), ti:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hered I432 | n B lif o(tidc < skubtn() RuntWorkhCollr, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFto, xCOL.L_UxNRuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~ OLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreadsIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, wo), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tid/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(steInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grou29p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadId:x.x/WARP_SIZE; \ | ^ 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, d aconstt int wa = 2threadIdx,.x/ WAfRPl_SIaZEg; \ 2| ^ ; In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp: 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hinitializer order does not match the declaration order [-Wreorder-ctor]:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre670ads(n | thre ads), t i dtid(tid), nthreInBlock(threaadIdxd.x),s g(rnotuhpr(egarodus)p),, tidInBlock(threadIdx.x), group(gr | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[Noup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buCCL_fPROTOf_SIMPLE]/NCCL_STESPS/sizeizes[NCCL_PRof(OT) T: steOpSize__SIMPLE]/NCCL_STEPS) /{ | s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupi /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34z:e7o: fnote: (in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereT ) : s34t | e pS i z e _)p ri{m s(| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group tid, n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hthreads, &:34:7ri:ng -note: >pin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here r 34 | ev, &ring->next, work->sendbuff, w prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->roerk->reccvbuvff, bwork->redOpArg,u 0, wfork->cofnnInd,ex, wo rk->cownonrIkn-d>redOpArg, 0, work->connex)I; | ^n /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hd:65:e5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | rux, work->connIndneRingx(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, wo:r432:78: knote: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here) ; 432 | | ^ i/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hf (ti:d < 432sub:tn78) R:u nWonote: rin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here kC o432l | l < Fn , T, ReifdO (tp, iAld g().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Red_uUcNReOLLS>c()a.run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: ttin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested heree r_ RING_7SI | MPDLEE_SFumI_fN3E_ncclDevFunc(ReduceScatter_RING_SIM2_P2,L nEcc_lFSunucRmeduce_Scfat3t2er_, 2Fu,nc Suncm,c flloFatun, cNCReCL_AdLuGOc_ReISNGc, aNCtCLt_PeROrTO,_ FuncSum, float, NCCL_ALGO_RISNGIM,PLE , 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, a /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hlgo:611,:62 : note: expanded from macro 'DEFINE_ncclDevFunc'pro t 611 | RunWorkBatch, algo, op, urnroollt>(o).,ru n(u);n \r o| ^ll> /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h():670:15:. note: field 'nthreads' will be initialized after field 'tidInBlock' r670 | u n t(id)(t;id), n\t hr ea| ds ^(n th/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hreads), :ti670dI:nB15:lo cknote: (tfield 'nthreads' will be initialized after field 'tidInBlock'hr e 670 | ad I dx . tx)i,d( gtroid), nthreads(nthreup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tida),ds ),n ttidhIrnBelock(tharedadsIdx.x), group(grou(pn)th,r e a| ^~~~~~~~~~~~~~~~~d /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(st),i dti)dI,nB lnockt(thhrreeadIadxd.xs),( gnrotuph(grroeuadp),s )| ^~~~~~~~~~~, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBloFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_SIn file included from TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, wor/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497k-> | c on nI nde x,w waorrk-p>IconnInndBex); l | oc ^ k(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.ht:65:h5:r note: ein instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here a d65 | I dx .ruxnRi/ngW(tid, nthreads, w /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncRork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78:educeScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < , | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), grsuobtun)p R(ungWroorukpCo)ll,< Fn ,| T ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~, Re | d warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3O p , Al499go | , stProto, COLL_UepSize(ncclShmeNmRO.LLc>(o).mrmun.(tbidu, fsufbtSn,i wzorek)s; [ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cppNC:C12:L1:_ note: Pin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here R O12 | TDEO_FINLEL_n1c28cl]De/NCCL_STEPS/sizeof(uint64_t)) { vFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclF| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tiundcR,ed ucneStcahttrer,e FauncdSsum,, fl&rioang->prev, &ring->next, work->ts, eNCnCLd_AbLGuO_fRIfNG,, N CCwL_oPRrkOT-O_>SIrMPeLEc, v4) | b^u f/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hf:611,:62 : wnote: expanded from macro 'DEFINE_ncclDevFunc'o r611k | - > RrunWeordkBOaptchAconnIndex, work->connIndex)t;y, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670 : | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRin15:g note: field 'nthreads' will be initialized after field 'tidInBlock' (tid, nthreadds(t,id ),w nothrrekad)s(;nt h rea| ds ^), t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hidInBl:o432ck:(thre78a:dI note: dx.xin instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here) , group432( | gr oup ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :670i:60f: (note: tfield 'group' will be initialized after field 'stepSize' i670 | d ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ id), nthreads(nthreads), tidInBlock(thread[ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o Idx.x), group(group), | ^~~~~~~~~~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from :34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, wo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:r2: In file included from k/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11-: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h>:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:r670:e15:c vbwarning: initializer order does not match the declaration order [-Wreorder-ctor]uf f670 | tid(tid), nthreads(nthreads), tid, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: InBlocin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested herek(t hr eadIdx65.x) | , g rou p (gr oupr), u | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ in671g< | T , Re dOstp,ep PSirozet(os,t CepOSLiLz_UNeROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: _ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupnote: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 7 34 | | D pErimFs(tIid,N ntEhre_adsn, &crincg->lpreDv, e&rivng-F>neuxt,n wocr(k->senRdbuff, eworkd->ruecvcbufef, work->redOpArg, 0, work->connIndex, work->connIndexScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ )/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread498:Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hn:34:7: note: Bin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | l porims(ctid, nkthreads,( &rintghreadIdx.x), group->pre(v, &ring->next, gworrokup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~->sendbuff, work->recvbuff, work->redOpArg, 0, work->connInde x| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ ,671 | w steopSirze(stepSize_k- == 0 >? nccclShomem.ncommn.buffSIindzees[xN); | ^ CCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h):79 :5:{ note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | | runRi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ng, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here OLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWor/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f3kCollprev, &ring->next, work->sendbuff, work->recvbuff, work->redOp2_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Arg, 0, work->connIndex, woro, Proto, COkLL_UNR-OLL>(>).run(tid,c soubtn, work); | n ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cppn:5I:1ndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h::65 note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here: 5 | 5DEFIN:E_ncc lDevFunc(ReduceScatter_RING_LL128_Sum_f32_2, ncclFuncReduceScatter, FuncSunote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gronote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrollup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from ba/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrrrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring-/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2>next, work->sendbuff, : wIn file included from o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hr:k11-: >In file included from r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.he:cv173b: uf/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hf,: 670w:o15r:k -warning: >rinitializer order does not match the declaration order [-Wreorder-ctor]e dOpArg, 0, work->connI n670d | e x , wtoirdk(-t>icdo)n,n Inntdherxe)a;d s (| n ^t hreads), tidInBlock(th/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hre:a65d:I5d:x .note: xin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here) , group(g r65o | u p ) , r u| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R i n| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_< T, RedOp, P r671o | t o , CsOtLeLp_SUiNzReO(LsLt>e(ptSiidz,e n_t h=r=e a0d s?, nwcocrlkS)h;m e| m ^. comm.buffSi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hz:e432s:[78N:C Cnote: Lin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here_ PROTO_SIM P432L | E ] / N C C Li_fS T(EtPiSd/ s(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h):.34r:u7n:( tnote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hered , subtn, work )34; | | ^ prims(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cppt:i7d:,1 :n tnote: hin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herer eads, &ring- >7p | rDeEvF,I N&Er_inncgc-l>DneevxFtu,nc (wRoerdku-c>esSecnadtbtuefrf_,R IwNoGr_kS-I>MrPeLcEv_Sbuumf_ff6,4 _w2o, rnkc-c>lFruendcORpeAdrugc,e S0c,a twtoerrk,- >FcuonncnSIunmd,e xd,o uwbloer,k -N>CcCoLn_nAILnGdOe_xR)I;N G ,| ^N CCL_PR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hO:T65O:_5S:I Mnote: Pin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested hereL E, 2) 65| ^ | run/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hR:i611n:62g:< Tnote: ,expanded from macro 'DEFINE_ncclDevFunc' RedOp, P611r | o t o , RCuOnLWLo_rUkNBRaOtLcLh><(ctoildl,, nttyh,r eraeddso,p ,k )a;l g o| , ^ proto, unr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ho:l432l:>78(:) .note: rin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereu n(); \ | ^ 432 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h : 670i:f15 :( tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd < subtn) 670R | u n W o rtkiCdo(ltlid(I)d.xr.uxn)(,t igdr,ou ps(ugbrtonu,p )w,o r k| ) ^~~~~~~~~~~~~~~~~; | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h ^: 670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: 670in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here | tid(tid) ,7 | nDtEhFrIeNaEd_sn(cnctlhDreevaFdusn)c,( RteidduIcneBSlcoactkt(etrh_rReIaNdGI_dSxI.MxP)L,E _gSruomu_pf(6g4r_ou2p,) ,n c c| l ^~~~~~~~~~~F uncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:In file included from 670/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou:15p), | ^~~~~~~~~~~ : note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, N, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(CCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, heaIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uind, manttissa; | ^ 32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZ/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 7510 | warnings generated when compiling for gfx1100. barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ E; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp :c2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:o11: In file included from n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:s174: t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h :75:i7: nwarning: unused variable 'w' [-Wunused-variable]t 75 | w b arri=er_by t_grohup()In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ readIdx.x/WARP_S;I | Z ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hE:29:15;: note: expanded from macro 'barrier_by_group' \ 29 | con| st i ^nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp :1452 | : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h : 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174uint32_t: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: dwarning: ata1unused variable 'flag1' [-Wunused-variable], fl ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:282:, fl ag2;warning: | ^~~~~unused variable 'data2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h: 145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | 145 uin | t32_t da ta1, flaug1,i datna2,t f32_tlag 2;data1, fl | a ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hg:1451:35,: warning: unused variable 'flag2' [-Wunused-variable]d at145a | 2 , ufilantg322;_t d| a ^~~~~t a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h1,: f145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dalag1, datta2, afla1g2, flag1, data2, flag2; | ^~~~~ ; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grou/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:p2: In file included from (/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h):175: ;/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h: 80: 5: warning: | unused variable 'w' [-Wunused-variable] ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h80 | :ba29rr:ier15_by_:gr ounote: pexpanded from macro 'barrier_by_group'( )29; | | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h :29c:15on: snote: texpanded from macro 'barrier_by_group' int29 | w = threadIconsdtx i.nt w x/= WtARhrePa_dSIIdZEx.;x /\ W AR| ^P_ SIZE; \ | ^ 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo]/cNCCkL_(STEPtS/shizeroef(Ta) : stdepSIizde_x) {. | x ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ )| group(group ,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34 :7:g note: rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here o34 | u prpims((tigd, rnthoreaudps, )&ri,ng- > p| rev, ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ &rin g| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ->next, work->sendbuff, work->recvbuff, work->rsetepdSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_OpAPrRg, O0, TworkO->c_onnSIndIex,M PworLk->cEonn]Ind/ex); | N ^ CCL_STE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:PS65:/s5:iz enote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here of 65 | ( T ) :ru nsRtepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:ing34, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereo, COL L_UNRO34LL | >( t pidri,m s(nttihrd, nthreaeds,a wodrk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here s, &r432ing | -> pr ev, &r ing ->next, work->sendbuff, work ->if (rectivd redOpAWorrkCgoll,COLcLo_nUnNRInOLdeL>x(,) .wrounr(kt-i>d,c onsunInbdex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | tn, wor k); r| ^In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ u /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cppn:7R:1:i note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested heren g7 | D(tid, nthreads, workf8_2, nc)cl;Fu nc Redu| ce ^Sc at/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hte:r432, F:78un: cSnote: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereu m, 432 | if (tid < sub rtccnl_fl)oat 8, RNCCuL_nALG/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hW:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]O_RING, oNCrkCL_CoPRlOlrPROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce(k)B.runat(ch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ opd, n,, wor k)a;l | ^g /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cppo:, 7:pr1:o note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested heret 7 | DEFINE_ncclDevFuo, ncun(rRolleduceScatter_RING_SI>M()P.ruLn()E; \_ | S ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hu:670m:15:_ note: field 'nthreads' will be initialized after field 'tidInBlock' f 8670_ | 2 ti,d( nticd)c,l nFuthnrceRaedds(ucenSctahtretaer, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_Sds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreaIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | d ^s(n th/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hreads):, 670tidInBl:ock15(th:re adnote: Idfield 'nthreads' will be initialized after field 'tidInBlock'x. x) , gr670ou | p (g ro up), t | ^~~~~~~~~~~i d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROT/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hO_SIMP:670:15:LE]/NC CL_warning: STEinitializer order does not match the declaration order [-Wreorder-ctor]P S/ si670z | e o f (T )t i:d (stited)pSi,ze _n) t{hre a | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ds (nt | group(grouph r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &reiandsg), -ti>dpInBrlocev, &ring->next, work->sendbuff, work->rk(thecreavdIdbx.uxf), fgro,up w(grooupr)k,- | > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hT) : stepSize_): {65 | : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5| group(group : note: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here : 34:657 | : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here ru nRing(34 | t i d pr,ims (tind, nthretadsh, r&rieng-a>prdev,s &r,ing ->next, wwork-o>rsk)endb;uf f,| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hwork:->432r:78:ec note: vbin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereu ff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78:432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[Batch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thrNCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t daIn file included from ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_g 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stxepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | primsIn file included from (tid, nthreads/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, workSIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_CCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested heret, NCCL_ALGO_RING, NCCL_PR 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tiads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFdu/WARnP_SIZcE), | ( ~~~~~~~~~~~~~~~~~~ R | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) e 497 | dwarpuIcneScatter_RING_SIMPLE_Sum_u32_4, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ncBlock(threadcIdx.xl/WARFP_SIZuE), ncRe| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_ 498 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().ru flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ n(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(t | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 duceScatter,LL128_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchL12,8 ]/NCCaL_lSgToE,PS/si zpreoof(utinot64,_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, & unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(rtingh->prreve, &arindg->Inexdt, workx->s.enxdbuff, )wor,k-> recgvburffoup(group), | ^~~~~~~~~~~~~~~~~ , work/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h->redO:pAr670g, :060, w:or note: k-field 'group' will be initialized after field 'stepSize'> c 670 | tid(otnniInddex, )wor,k-> connnIntdexh)r; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:ea79ds(:nth5re:ad s)note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work-:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62:>connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthre/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hs):,670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbufeduceScatterf, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tid_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZEIn file included from ; \ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: | warning: unused variable 'w' [-Wunused-variable] 75 | ^ barri er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, dat/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from unused variable 'w' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t In file included from d/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ WARP_SIZE; \ | ^ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fl const int w = threadIdx.x/WARP_SIZE; \ | ^ag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Idx.x/WARP_SIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ IZE; \ | ^ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145In file included from | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, In file included from work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h::15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h15:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:: 18: In file included from warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:77 | 18 : warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ Index, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, ceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), of(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, & note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), ntring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex,:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtnCL_)STEPS/s izeof(RT) : stuepSizen_) { W| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:r7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here k34 | prims(tid, Cnthreados, &rilnglprev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_worSk-I>conMnIndePx); L | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hE:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65_ | Sum_u64In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173_4, r unRinng (tidN, nthrCeadCs,L_ALGO_ wRork)I; NG| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWork, NCCLC_PROToO_Sll, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(toi, Prodto, )COLL,_UNR OLL>n().rtun(tihd, srubtn, work); e | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cppa:12:d1: snote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here ( 12 | nDEFItNE_nhcclDrevFuenc(Raeducds), tidInBlock(threScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FunceSadIdux.xm), gr,oup( grouup), i | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hn:670:60t: note: field 'group' will be initialized after field 'stepSize' 670 | 6tid(4tid)_t, NCCL_ALGO_RING, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRingr(tioup),d, nth r | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | eads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepS ize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N if (tid < subtn) RunWorkColl().run(tid, subtn, wCCLo_STEPSr/sizeokf(T) : )stepS;ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34: ^7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | 34 | D E primsFINE_ncclDevFu(tidn, nthrceads,(In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ReduceScatter &_RINGring-_>prevS, &In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ riIMPng-L>E_Sum_u64_4, nccnexlt, woFrk->suendbnufcReduceSf, work->recvbuff, work->redOpArg, 0, work->catter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTOIn file included from _SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algcoonnIn,dex, work-p>connrIndexo)/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t; | ^o /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:,65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65u | nrunRirng().run(); \ rot o, C| OLL_ ^UNRO LL>(/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.htid, nth:read670s, wo:rk);15 | ^: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | 432 :78t: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herei 432d | (t if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ro ^~~~~~~~~~~to, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlocIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hk(threadIdx.x), group(group), | ^~~~~~~~~~~ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barri, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: cIn file included from onst in/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.ht w = thr:eadIdx.11x/WARP_S: IZE; \ In file included from | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx./builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cppx/WAR:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] P_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAR/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cppP:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11_: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hS:75:7: warning: unused variable 'w' [-Wunused-variable]I 75 | Z baErrier_b; \ y _gro| up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_ ^S IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'In file included from 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(oup), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recv | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hea:670d:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] s670 | ) tid(ti,d), nth readst(nthreaids), tiddInBlocInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~k(th readIdx/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h.x), group(g:roup),670 | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | : tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 67160: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread | s ste(pSize(sntepSizet_ == h0 ? nreads), tidIncclShmem.comm.buffSizes[NCCL_PROBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCol10 warnings generated when compiling for host. l().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18In file included from : warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1200. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const intIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ orkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint 3772 | _ t y , h e audi,n tm3a2n_tti sys,a ;h e a| d ^, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, wo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),rk->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCLunused variable 'y' [-Wunused-variable]_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(ti/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hd, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_tIn file included from data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp: :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: :In file included from 145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:In file included from 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threIn file included from adI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cppd:x2.: xIn file included from )/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h,: 11g: rIn file included from o/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hu:p173(: gr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ho:u670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] p), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:15:34 :warning: 7initializer order does not match the declaration order [-Wreorder-ctor]: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | 670 | ptriimds((ttiidd),, nntthhrreads(nthreads), tidIenBaldosc,k &(rtihnrge-a>dpIrdexv.,x )&,r ignrgo-u>pn(egxrto,u pw)o,r k -| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s e n| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_b uff, work- >671r | e c v b usftfe,p Swiozrek(-s>treepdSOipzAer_g ,= =0 ,0 w?o rnkc-c>lcSohnmneImn.dceoxm,m .wbourfkf-S>iczoensn[INnCdCeLx_)P;R O T| O ^_ SIMPLE]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h/:N65C:C5:L _note: Sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested hereT EPS/siz e65o | f ( T ) r:u nsRtienpgS(tid, nthreads,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h :w34o:r7k:) note: ;in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here | ^ 34 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h : 432 : 78 : pnote: rin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herei ms(tid, nt h432r | e a d s , i&fr i(ntgi-d> pWnoerxktC,o lwlo sTe,n dRbeudfOfp,, wAolrgko-,> rPercovtbou,f fC,O LwLo_rUkN-R>OrLeLd>O(p)A.rrgu,n (0t,i dw,o rsku-b>tcno,n nwIonrdke)x;, w| o ^r k->connInde/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cppx:)12;: 1 : | note: ^in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65 :125 | :D Enote: Fin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested hereI NE_nccl D65e | v F u n cr(uRneRdiuncge_(ut3i2d_,4 ,n tnhcrcelaFdusn,c Rweodrukc)e;S c a| t ^t er, FuncSu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hm:P432o:s78t:D inote: vin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here, uint32_t ,432 | N C C L _ A LiGfO _(RtIiNdG ,< NCsCuLb_tPnR)O TROu_nSWIoMrPkLCEo,l l4<)F n ,| ^T , RedOp,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :A611l:g62:o ,note: expanded from macro 'DEFINE_ncclDevFunc'P roto, CO L611L | _ U N RROuLnLW>o(r)k.Brautnc(ht| , ^ algo, pr/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cppo:t7o:,1 :u nnote: rin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereo ll>().run(); \7 | D| E ^F INE_ncclDev/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hF:u670n:c15(:R enote: field 'nthreads' will be initialized after field 'tidInBlock' duceS c670a | t t e r _tRiIdN(Gt_iSdI)M,P LnEt_hSruemaPdoss(tnDtihvr_eaud3s2)_,2 ,t indcIcnlBFluoncckR(etdhurceeaSdcIadtxt.exr),, FgurnoucpS(ugmrPoouspt)D,i v ,| ^~~~~~~~~~~~~~~~~u int/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h3:2670_:t60,: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tNid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_uIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nc32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NcClCLS_hSmTeEmP.Sco/msmi.zbeuofff(STi)z e:s [sNtCeCpLS_iPzReO_T)O _{S I M| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L E ]| / group(groupN CCL_STEPS/sizeof(T) : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hs:t34ep:S7i:z enote: _in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 34 | prims(tid, nthreads, &ring->prev, &r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hi:n34g:-7>:n enote: xtin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here, work->sendbuff ,34 | w o r k - > rpercivmbsu(ftfi,d ,w onrtkh-r>eraeddsO,p A&rrgi,n g0-,> pwroervk,- >&croinnngI-n>dneexx,t ,w owrokr-k>-c>osnennIdnbduefxf),; w o| r ^k ->recv/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hb:u65f:f5,: wnote: oin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested herer k->red O65p | Ar g , 0r,u nwRoirnkg-<>Tc,o nRneIdnOdpe,x ,P rwootrok,- >CcOoLnLn_IUnNdReOxL)L;> ( t| i ^d , nthreads, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hw:o65r:k5):; note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here| ^ 65 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h : 432r:u78n:R inote: nin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereg b(ttni)d ,R unntWhorrekaCdosl,l , 1, 2, 2>::run' requested hereL L_UNROLL>( )432. | r u n ( t i di,f (stuibdt n<, swuobrtkn)); R u| n ^W orkColl, 1, 2, 4>::run' requested hereO p, Algo, Pro t12o | ,D ECFOILNLE__UnNcRcOLlLD>e(v)F.urnucn((Rteiddu,c esSucbattnt,e rw_oRrIkN)G;_ S I| M ^P LE_SumPost/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cppD:i7v:_1u:3 2note: _in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here4 , ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NC7C | LD_EAFLIGNOE__RnIcNcGl,D eNvCFCuLn_cP(RROeTdOu_cSeISMcPaLtEt,e r4_)R I N| G^_ SIMPLE_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hS:u611m:P62o:s tnote: Dexpanded from macro 'DEFINE_ncclDevFunc'i v_u32_2 ,611 | n c c l FRuunncWRoerdkuBcaetScchav,, aulignot,3 2p_rto,t oN,C CuLn_rAoLlGlO>_(R)I.NrGu,n (N)C;C L\_ P R| O ^T O_SIMPLE/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h,: 6702:)15 : | note: ^field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :670611 | : 62 : note: texpanded from macro 'DEFINE_ncclDevFunc'i d(tid), n611t | h r e a dRsu(nnWtohrrkeBaadtsc)h,< ctoildlI,n Btlyo,c kr(etdhorped,x .axl)g,o ,g rporuopt(og,r ouunpr)o, l l| > ^~~~~~~~~~~~~~~~~( ).ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hn:(670):;60 :\ note: field 'group' will be initialized after field 'stepSize'| ^ 670 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h :t670i:d15(:t inote: dfield 'nthreads' will be initialized after field 'tidInBlock') , nthread s670( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp 11 warnings generated when compiling for gfx942. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPoProto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_stDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(throefa(dTI)d x:. xs)t,e pgSriozuep_()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hs:t34e:p7S:i znote: ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here( stepSize_ == 0 34? | n c c l S hpmreimm.sc(otmimd.,b unftfhSriezaedss[,N C&CrLi_nPgR-O>TpOr_eSvI,M P&LrEi]n/gN-C>CnLe_xStT,E PwSo/rski-z>esoefn(dTb)u f:f ,s tweoprSki-z>er_e)c v{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, w| o group(groupr k->redOpArg, 0/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h,: 34w:o7r:k -note: >in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested herec onnIndex, w o34r | k - > c o n npIrnidmesx()t;i d ,| ^n threa/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hd:s65,: 5&:r inote: nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested hereg ->prev ,65 | & r i n gr-u>nnReixntg,< Tw,o rRke-d>Ospe,n dPbruoftfo,, wCoOrLkL-_>UrNeRcOvLbLu>f(ft,i dw,o rnkt-h>rreeaddOsp,A rwgo,r k0),; w o| r ^k ->connInd/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.he:x432,: 78w:o rnote: kin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here- >connInd e432x | ) ; | ^ if (t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.hi:d65 :<5 :s unote: bin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested heret n) Run W65o | r k C o lrluR(OtLiLd>,( )n.trhurne(atdisd,, wsourbkt)n;, w| o ^r k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cppnote: :in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here12 :1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 432 | i f12 | (DtEiFdI Nc(c)l.Fruunnc(Rteiddu,c esSucbattnt,e rw,o rFku)n;c S u| m ^P ostDiv, u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cppi:n12t:614:_ tnote: ,in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here NCCL_ALGO_R I12N | GD,E FNICNCEL__nPcRcOlTDOe_vSFIuMnPcL(ER,e d4u)c e S| c^a tter_R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hI:N611G:_62S:I Mnote: Pexpanded from macro 'DEFINE_ncclDevFunc'L E_SumPo s611t | D i v _ uR6u4n_W4o,r knBcactlcFhu, algo, proto, unr,oll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadIn file included from s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPL tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hunused variable 'data1' [-Wunused-variable]:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h: 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tiIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tiddInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, | ^~~~~~~~~~~ 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, workIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid);( | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.ht:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | i if (tid < subdtn) RunW)orkColl,().reuin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested heren 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, n(tiad, subtnd, threads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ worsk); | ^( /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1n:threads), tidInBloc note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group S/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hum_bf16_2, ncclFuncReduc:e, Func33Sum, h:ip_bfl7oat16:, NCCL _ALGO_note: RING,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here NCCL_PR OTO_SI MPLE, 2) | ^ 33/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611: | 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWo rkBatch , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hrims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->:670:r15: note: efield 'nthreads' will be initialized after field 'tidInBlock' d670 | O tipd(tiAd), rnthregads(,nthr eads0), ti,dIn Blockw(throeadIrdx.xk), g-rou>connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRinpg(gro(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, wIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), groupork->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ LL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreadsIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepS1i6, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connI nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < sndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, Nubtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DECFCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ INE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(:n670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]t 670 | h tird(tid)e, nthraeads(ndthreadss), ti), tidInBldInBloock(thcrk(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmemSi.ze(stecpSize_o == mm.bu0f ? nccflShmemS.comm.ibuffSizes[NCCL_PROTO_zSes[NCICL_PROMTO_SIMPPLLE]/NCCL_STEPS/sizeEo]/NCCfL_STEP(S/sizeTof() : stepSiT) z: stepSeiz_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ e _) { | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group group(group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33: 7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, prims(tid, nthreads, &ring->prev, &ring->next, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex,w ork-w>sendobuff, rwork-k>recv-buff,> worck->redOpArg, 0o, wornk->connnIndIex, wnork-d>connIendex)x; ); | | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | run RirunRing(ttio, CdOLL_UN,ROLL> (tid,n nthrteadshreads, , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, 1, 2, 4>::run' requested here> 432( | ) if. (rtid u< sunbtn) R(unWotrkCoill ().wrun(tid, subton, wrork)k; | ); | ^ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | D 12E | DFEFINIE_ncclDevFNE_ncclDevFuncunc((ReduRce_RIeNG_SdIMPuce_LE_Sum_bf16_4,RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPL nccElFu,ncR edu4ce,) Fu ncS um,| hi^p_b flo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hat16, :NCC611L_A:LGO62_RI:NG, NCnote: CL_PROTO_SIexpanded from macro 'DEFINE_ncclDevFunc'MPL E , 4) 611 | ^ | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h :611: 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611R | u RunnWoWrkBoatcrh, , raedolpg, a,lg o, pprorto,o tunrooll,> unrol().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:l>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nt670:h15: rnote: field 'nthreads' will be initialized after field 'tidInBlock' e 670a | d tsid(), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),tid ), tnthirdeadIs(ntnhreBadls)o, ticdkInB(lockt(thhreardIdex.xa),d gIroup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const iIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 2_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_groupIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SI(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tidIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo(ctid), ntkhreads((nthreadts), tihdInBlockr(threadIdex.x), graoup(grodIdx.x)up),, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | g rostup(group), | epSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepS ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->coev, n&ring->next, wornk->seIndbnuff, wodrk->erecvbufxf, work->connInd, worek->redxOpArg,) ; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | 63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hl:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ l().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->c/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' onnIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrol/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hl>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->reRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8cvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, _2, ncclFuncReduce, Fuunroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid,:670 nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | i:15:f (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 63 | ru 670 | tid(tid), nthnreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Ring(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ warnings generated when compiling for host. | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.co/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, 670 | nt tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nt/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hh:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, headIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18:, manti ssa; | ^ warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, maIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: ntiss/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.ha; | ^ :77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ E; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h::11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h7:75:7: warning: unused variable 'w' [-Wunused-variable] : 75 | barrier_warning: by_grouunused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:2915: note: expanded from macro 'barrier_by_group' 29 | | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ cons t int w = threadId x. const int w = threadIdx.x/Wx/WARP_SAIZE; \ | R ^ P_SI/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:In file included from 174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hb:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174a: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7r: warning: unused variable 'w' [-Wunused-variable]rier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 75 | barrier_by_group(); 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data111: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:, warning: unused variable 'data1' [-Wunused-variable] 145 | fuint32l_t dataa1, flgag1, d1ata2, ,fla g2; | d ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable]a 145 | t uinta32_t d2ata1, ,flag1 , datfa2, fllag2;ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28:d warning: unused variable 'data2' [-Wunused-variable] a145 | tuint32_t dataa1, fl1, flag1, datag1, data2, flaga2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WA: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.hR:80:5: Pwarning: unused variable 'w' [-Wunused-variable] _80 | S barrIier_byZ_grouEp(); ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h: 29:15: \note: expanded from macro 'barrier_by_group' 29 | const| int ^w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARPIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11 ^~~~~~~~~~~~~~~~~~: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), t tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==SI ZE), | ~~~~~~~~~~~~~~~~~~0 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpI?nBlock(t ncclShmem.comm.buffSizhreeadIdx.x/sWARP_SI[ZE), | N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE C498 | fClagThreaLd((tid%_4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCPROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &riCLn_STEPS/gsizeof(-uint64>_t)) { p | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33e:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here v33 | , prim &ring->next, wos(rtid, knthre-ads, >&rings->preendbuff, work->recvbuff, work->redOpArgv,, &ri ng->n0ext,, work-> senwdbuffo, worrk->recvkbuff, work->redOpArg, 0, work->connIndex, work-->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn)>con nIndexR); | ^u /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hn:77:5:W note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here o77 | r runkRing(tid,, nth reads, Twork),; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Su COmLL_U_NROLL>f().ru1n(tid,6 subt_n, wo2rk); , | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp: 5:1: nnote: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5c | DEFINE_ncclcDevFulnc(ReFduce_RuING_LLn128_ScumReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_f1_6_2, PncclFRuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'OTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h 611 | : Ru670nWork:Batch15, sa)lgo,, p rotto, iunrodll>I().nrunB();l \ o | ^c k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | R/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x stepSi), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid unWzorkBaetch, ealgpo, pSroto,i unrozll>()e.run(_); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h=:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | =< subtn) RunWorkColl().run(tid, subtn, tid(work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g0 ? ncclShmem.comm.buffSizes[NCtid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0,670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hh:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ alf, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2b: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11a: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hr:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:r75:7: iwarning: unused variable 'w' [-Wunused-variable] e75 | r b_arriber_byy_g_groroup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | coIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nst int w = threadIdx.x/WARP_SIZEIn file included from ; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from arrier/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp_:2: bIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: yIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174_: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:g7: warning: runused variable 'w' [-Wunused-variable] o75 | u pbarri(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: er_bexpanded from macro 'barrier_by_group'y_gr oup (); | 29 ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h | :29:15: note: expanded from macro 'barrier_by_group' 29 | c onstc into w =n thrseadItdx.x /WARiP_SInZE; t\ | ^ w = threadIdx.x/WAR P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145In file included from | ui/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: nt32_t data1, flagIn file included from 1/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: ,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174 : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145d:14:a warning: unused variable 'data1' [-Wunused-variable] t 145 | a 2uint3,2_t dataf1, fllag1a, datga2,2 flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t d;ata1, fl | ^~~~~a /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:g145:21:1 warning: unused variable 'flag1' [-Wunused-variable] , 145 data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | | u int32_t data1 , fl ag1u, daita2, nflagt2; 3| ^~~~~2 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, d_at datta1,a fla2g1, ,data2 , flfalag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:g2;35 : warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3| ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h2:145:_35:t warning: unused variable 'flag2' [-Wunused-variable] 145 | uintd32_ta datta1, fala1, flag1, data2g1, data2, flag2; | ^~~~~ , flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h: 29 | 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | baIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), 11nthreads(nthreads), warningtisdI generatednB when compiling for logfx90a. ck(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpIn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hBlock:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, wo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid)k); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWo/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:10 warnings generated when compiling for host. 63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_tata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cppIn file included from :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tidPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg%,4)==3), g roup(grou0p), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~, | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSizwe(ncclShomem.comm.rbuffSizes[kNCCL_PROT-O_LL12>connI8]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->ndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollne,xt, wo rk->seAndbuffl, workg->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRingd().runO(tid, psubtn,, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cppProtoLL128, COLL_UNROLL>(tid, nthreads, :7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NworCk); C| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hL:432:78_: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here P 432 | R iOf (tiTd < sOu_SIMPLE, 2btn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (Reduce_RING_LL128_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(ttid, hnthreads, worrke); a| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hd:432:s78: note: )in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here ,432 | t iif dI(ntid B< sulbtn)o RuncWorkkColl(threadIdx.x), group(grou().prun(Stiize(stepSize_ d, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_n== 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(ticcdlD,evFu nc(Renducet_RINhG_SIrMPLeE_Suam_f6d4_2,s, &ring->prev, &rin gnccl-Func>Redunce, FeuncSuxm, dtoubl,e, N CCL_wALGO_oRINGr, NkCCL_PROTO_S-IMPL>E, 2s) | e^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hn:611:62d: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ buff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[Block(thNreadIdx.xC), group(gCroup), | L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSiz_e(stepSPROiTzeO_SIMPLE]/NCCL_ =_= 0 ? nccSlShmemTE.cPS/omm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connI33 | n pdrims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^e x); | ^/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63: | 63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | run Ring(tidn, nthreRaingds, , 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColloto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subt| ^n /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:,1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | wDEFINEo_ncclDrevFunc(Rk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: educenote: _RING_in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereSIMPLE _S 12 | Dum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' EFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Run611W | o RunWrorkBkatchB,< algoc, prooto, unrloll>l(),.run( ); t\ y, redop<| ^ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:y670:15:> , algo,note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(t prhotor, eaudnrollI>().drun(x); \. | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hx:670:15), group(g: rnote: field 'nthreads' will be initialized after field 'tidInBlock' o670 | u tpid(t)id),, nthreads(n thre ads)| , ti ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthre anote: field 'group' will be initialized after field 'stepSize'd s670 | ( tnid(ttidh), rnthereadas(ndthresads)), ,tid InBtloick(dthreIadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, w/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | dOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:unused variable 'w' [-Wunused-variable]2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:757: | warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11In file included from warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cppIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hIn file included from :173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hwarning: initializer order does not match the declaration order [-Wreorder-ctor]:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h 670 | tid(tid),:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), nthr| eads ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497(nthr | eads), ti dInBlock(th readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(n/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, worNCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~In file included from | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuf/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from k->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(f, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndenthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZEx)); ,| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h :63 :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here| ~~~~~~~~~~~~~~~~~~63 | runRi ng(tihd, readnIthrdx.x/WARP_eaSds, wIork);ZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThr e| ^ a/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:d78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here( 432 | ( tif (tiid ([).ruNn(tiCd, sCubtnL, _PROTO_LL128]/NCCLw_ork)S; | T ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cppE:7:1P: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereS /sizeof(uint64_t7) | DEF)INE_ nccl{DevFu nc( R| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group educe_RING/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h_S:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here IM PLE_Sum_33u | prim32_s2, n(cclFtuncRid, nthreduece, Funads, &ring-cSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBat>prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid), nthread,s, work ); | ^ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hidInBlock(threadIdx.x), group:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here( 432 | g if r(tid < osubtup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | n ) RunWor kColl().run(tid, subtn, workmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | ); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp: 12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here p 12 | DEFrINE_nccliDevFuncm(Reducse_RING_(SIMPLE_Sutm_u32_4i, ncclFduncReduc,e, Func Sum, uinnt3threads, &ring->prev, &r2_ti, NCCL_nALGO_RINgG, NCCL-_PRO>next, workTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: ->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthrexpanded from macro 'DEFINE_ncclDevFunc' e 611 | a dRunWsorkB,atch< collw, tyo, rerdop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if up),( | ^~~~~~~~~~~~~~~~~ t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:i60:d < subtn) RunWorkCollhrea(d).run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12Idx | .x)D, grEoup(gFroupI), N| ^~~~~~~~~~~ E_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 11 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:498::29670:: 15warning: : field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | 496 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr),e atdisdI),nB lwoicdk((ttihdr%eaWdAIRdPx_.SxI),Z Eg)ro,u pwa(rgpr(otuipd)/,W A| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~P_ S | I tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ZE ), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)671 | s t497e | p S i z ew(asrtpepISniBzleo_c k=(=t h0r e?a dnIcdcxl.Sxh/mWeAmR.Pc_oSmImZ.Eb)u,f f S| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~z e s| [ warp(tid/WARP_SIZEN CCL_PR O498TO | _ S I MfPlLEa]g/TNhCrCeLa_dS(T(EtPiSd/%s4i)z=e=o3f)(,T )g r:o uspt(egprSoiuzpe)_,) {| ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 | group(group 499 | stepSiz/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.he:(33n:c7c:l Snote: hin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herem em.comm.buf f33S | i z e s [ N CpCrLi_mPsR(OtTiOd_,L Ln1t2h8reads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, F]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ uncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | : /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 670 | tid(tid), nthreads(nthr 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algoIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: , prIn file included from oto, u/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hnroll>(:).run(173); : \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:: note: field 'nthreads' will be initialized after field 'tidInBlock' 670670 | t:id(tid)15, nthre:ads(nth rwarning: einitializer order does not match the declaration order [-Wreorder-ctor] ad 670 | tis), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), ntd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncchlreadsS(nthrheads)m, teidInBlock(tmhread.Idx.xc), grooup(grmoup),m | . ^~~~~~~~~~~b uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg63 | , 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nt:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tidhreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ < subtn) RunWorkColl().ru/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFuedOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ nc(Reduce_RING_SIMPLE_SumPostDi/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), ntv_i32_2, ncclFuncReduhreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring-/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threa>prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_reads,S work)T; | ^ E/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78:P note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | S /if (tid < subtns) RunWorikColl(). r un| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing ncclFuncReduce, FuncSumPostDiv, int(32_t, tNCid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nc 432 | c ifl (tid S< subtn) hRunWorkmColl(m).run(tmid, su.btn, worbk); | ^u /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1f: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12f | DEFINE_SncclDiezes[NCCvFunc(Reduce_RING_SIML_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, PLE_SumPostDi&v_i32_r4, nccilFuncRneduce,g FuncSu-mPostD>iv, ipnt32_tr, NCCL_eALGvO_RING,, NCCL_ PROTO_SIMPLE&, r4) i| n^ g-/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h>next, work->sendbuff, wor:611:62:k note: expanded from macro 'DEFINE_ncclDevFunc' 611- | R>unWorkrBatch, algecvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing().ruOn(); L\ L| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h_:670:15: note: Ufield 'nthreads' will be initialized after field 'tidInBlock' 670N | Rtid(tiOd), nLthreadLs(n>threads(), tidtInBlocik(thdreadId,x. nthreadsx,), gro up(growup)o, rk); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h::670:60: note: 432field 'group' will be initialized after field 'stepSize' :67078/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRi: note: | tid(tid), nthreads(nthin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl()re.ads), rtidInBlock(threaduIdx.x),n group((groutp), | i ^~~~~~~~~~~ d, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RIng(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NG_SIMPLE_SumPostDiv_i32_4,/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;In file included from \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, woIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlrk);ock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connI | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, Nndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, CCL_PROTOalgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groupIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmemROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SuIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_mPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, fIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn , work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7: 1670: | note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here tid(tid), 7n | tDhErFeIaNdEs_(nnctchlrDeeavdFsu)n,c (tRieddIuncBel_oRcIkN(Gt_hSrIeMaPdLIEd_xS.uxm)P,o sgtrDoiuvp_(ig8r_o2u,p )n,c c l| F ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n c| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_e duce, Func S671u | m P o s tsDtievp,S iiznet(8s_tte,p SNiCzCeL__ A=L=G O0_ R?I NnGc,c lNSChCmLe_mP.RcOoTmOm_.SbIuMfPfLSEi,z e2s)[ N C| C^L _PROTO/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h_:S611I:MP62L:E ]note: /expanded from macro 'DEFINE_ncclDevFunc'N CCL_STE P611S | / s i z eRoufn(WTo)r k:B asttcehp, algo, proto, unrol/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hl:>33(:)7.:r unote: nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here() ; \ | ^ 33 | /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h : 670 : 15 : pnote: rfield 'nthreads' will be initialized after field 'tidInBlock'im s(tid, n670t | h r e a dtsi,d (&triidn)g,- >nptrherve,a d&sr(inntgh-r>enaedxst),, wtoirdkI-n>Bsleoncdkb(utfhfr,e awdoIrdkx-.>xr)e,c vgbruofufp,( gwroorukp-)>,r e d| O ^~~~~~~~~~~~~~~~~p Arg/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h,: 6700:,60 :w onote: rfield 'group' will be initialized after field 'stepSize'k ->connIn d670e | x , w otrikd-(>tciodn)n,I nndtehxr)e;a d s| ( ^n thread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hs:)63,: 5t:i dnote: Iin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested heren Block(t h63r | e a d I drxu.nxR)i,n gg(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const i= threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 1010 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp 11 warnings generated when compiling for gfx942. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, maIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ntissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t data1, /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cppf:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: lIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.ha:75:7: warning: unused variable 'w' [-Wunused-variable] g75 | 1barrier,_by_gro up(); | ^~~~~~~~~~~~~~~~~~d /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: anote: expanded from macro 'barrier_by_group' 29 | ta2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: : In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from :/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:17314: : warning: unused variable 'data1' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75: 1457: warning: unused variable 'w' [-Wunused-variable] 75 | barr | i uint32_et data1,r f_by_grouplag1(, data2),; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' flag2; | 29 ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145 | const :21: warning: unused variable 'flag1' [-Wunused-variable] i 145 | nt w = uintthreadI32_t data1, flag1, data2dx.x/WARP_SIZE; \ | ^ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h: 145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(tIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tidIn file included from < subtn) RunWorkColl().run(tid, subtn/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp, wor:k2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h):; | ^ 11/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:: 1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here In file included from 7 | DEF/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.hINE_nc:clDevF173unc(Re: duce_/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, Fu 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | nc SumPos tDiv , uint6s4_t, NtCCL_ALeGO_RINGp, NCSCL_PRiOTO_SIzMPe(stepSizLE, 2e) | ^_ == 0 ? nc /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' c611lShm | e Rm.communWo.rkBbauffSizes[NCCL_PROTO_SIMPLE]/NCCL_STtch, a/lgo, psroto, iunrollzeof>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid)(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->ne,x nthreatds(nt,hreads ), tidIwnBlock(othreardIdx.x)k, grou-p(grou>p), | ^~~~~~~~~~~~~~~~~sendbuff, work- /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670>:60: note: rfield 'group' will be initialized after field 'stepSize' 670 | ecvbuff, work->red tOpArg, 0, work->cid(toid), nnthrneadsI(nthnreadsd), teidInxBloc,k(t hwork->connIndex); readIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg,next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFu/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hn:670:c15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ( 670 | R tid(etiduce_RING_SIMPLE_SumPostDid), nthreads(nthreads), tidInBlock(threadIv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) dx.x)| , gro^up(gr oup),/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :671 | 611 step:Size(62stepS:ize_ == 0note: ? nexpanded from macro 'DEFINE_ncclDevFunc'cclSh mem.c omm.buffS611izes[ | NCCL RunWorkBatch, alg_PoROTO_S,IMPLE ]/NCpCL_STrEPS/soizeof(tT) : ostepS,ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h : | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, 670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthrentahredadss, &)rin,g-> prtevidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nt,h &rringe->neaxt,d wosrk->(senndbutffh, wrorke->raecvdbufsf, )wor,k-> redtOpAirg, d0, Iwornk->connBIndlex,o wocrk-k>co(nnItndehx);r | e ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.ha:63d:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here I63 | d x.x), group(group), | ^~~~~~~~~~~ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE,:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, wodOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, wIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIrMk->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTIn file included from O/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from _/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: :initializer order does not match the declaration order [-Wreorder-ctor] 611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 670 | 611 ti | RunWorkBatd(ticd), nthhreads<(coll, nthreads),ty, redop, algo, proto, unroll>().run( tidI)nBlock(;thre adIdx.x\), gro up(g roup)| , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ t671 | istepSidze(st(tid), nthreads(nthreaepSize_ == 0 ? ndcclShmsem), tidInBlock(threa.comm.buffSizdes[NCCLI_PdRx.x), group(group), OTO_| SIMPL ^~~~~~~~~~~~~~~~~E]/NCC L_STEP/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hS/s:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | i zeof( T) : s t tid(tid), nthreadsepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:(nthreads), tidInBlock(threadIdx.x), group(gro33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62:670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work);In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | i f (ti d < subttn) RiunWorkdColl(n).run(Btid, sulbtn, woork); c | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cppk:12:(1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here t 12 | DhreEFIadIdx.x), grouNEp_ncclDe(vFunc(gReroup),duce_RING_SIMPLE_SumPostDiv_u8_4, ncclFun | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_cR)educe, FuncS{umPostDi v, uin t8_t, | NCCL_A ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~LGO_R ING, N CCL_PR| OTO_SI group(groupMPLE, 4) /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h| ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWork:Ba33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | tch, alg o, proto, unr&roll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ing->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.hti:d(tid63), nth:reads(5nt: hnote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested herer ead s), tidIn63Block(threadI | dx.x), group (group ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:r60: note: ufield 'group' will be initialized after field 'stepSize'nRing(tid, nthrereads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(ti d, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:498:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 496 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 497 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 498 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 499 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(n:670:15t: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670h | tid(trid), nthreeads(nthraeads), tdidInBlock(st), tidInBlochrekadIdx.x), (grthreadIdx.x), gorup(group)o, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | up(group) tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSiz e(ste stepSizpSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEe(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ringPS/s-izeof(T) >: stepSipze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.he:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here v33 | p,rims(tid, nthreads,& &ring-r>prev, &riing->next,n wg->next, work->soendbufrk->sfe, wornk->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ de/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.hx); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here: 63 | 432 runR:ingin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here(tid, nthrea ds, work); | ^432 /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432 | :78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl( if (tid < subtn) RunWorkColl().run(tid, subtn, work)).;run( tid, sub| tn, ^work ); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | :DE12FI:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_nNcE_nccclDelvFunDc(Reeducve_RFING_unc(Reduce_RINGSIM_PLSE_SumIPostMDiv_PuLE_SumP8_4o, ncsclFutncRDiv_u8_4, nccedluceF, FunucSumnPcostDiv, uint8_t, NCCL_ALGO_RReduIce,N FuGncS,umP oNstDCiv,CL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | : 611:62 : note: expanded from macro 'DEFINE_ncclDevFunc' 611 | R uRunnWorWkBaotchr,< coll, ty, redop, algoa,lgo , pprotro, uonrotll>o().,run ();u \ n | ^ r/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.holl>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthre anote: field 'group' will be initialized after field 'stepSize'd s670 | ( ntidt(tihd), rnthreeadas(ndthrsead)s),, ti dtInBilocdk(tIhrenadBIldxo.x)c, gkrou(p(gthreadroIupd),x.x ) | ^~~~~~~~~~~, group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connInde/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.hx:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for host. 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60402 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/git_version.cpp In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.hIn file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:257:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 257 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:259:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 259 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:271:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 271 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:269:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 269 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:In file included from 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:259:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 259 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:271:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 271 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, nccl/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] FuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(t/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h generated:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ when compiling for host. /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1101. [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/hipcc -fPIC -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -parallel-jobs=1 -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -Xlinker --dependency-file=CMakeFiles/rccl.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/rccl_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/register.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] Elapsed time (seconds): 8714.18 f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.4.43484 --hip-link --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' [100%] Built target rccl gmake[1]: Leaving directory '/builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build' /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/redhat-linux-build/CMakeFiles 0 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.M7sErW + umask 022 + cd /builddir/build/BUILD/rccl-6.4.2-build + '[' /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT '!=' / ']' + rm -rf /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT ++ dirname /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT + mkdir -p /builddir/build/BUILD/rccl-6.4.2-build + mkdir /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.4.2 + DESTDIR=/builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT + /usr/bin/cmake --install redhat-linux-build -- Install configuration: "RelWithDebInfo" -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64/librccl.so.1.0 -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64/librccl.so.1 -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64/librccl.so -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/include/rccl/rccl.h -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/include/rccl/nccl_net.h -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/include/rccl/amd_detail/api_trace.h -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb-op.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb-op.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple-op.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple_2.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-0-9kb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-190kb-512kb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-512kb-7mb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-7mb-43mb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-9kb-190kb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-targets-relwithdebinfo.cmake -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt + echo s@/builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT@@ + find /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64 -name '*.so.*.[0-9]' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64 -name '*.so.[0-9]' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64 -name '*.so' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/lib64 -name '*.cmake' + sed -f br.sed + '[' -f /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt ']' + rm /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt + /usr/bin/find-debuginfo -j2 --strict-build-id -m -i --build-id-seed 6.4.2-3.fc43 --unique-debug-suffix -6.4.2-3.fc43.x86_64 --unique-debug-src-base rccl-6.4.2-3.fc43.x86_64 --run-dwz --dwz-low-mem-die-limit 10000000 --dwz-max-die-limit 110000000 -S debugsourcefiles.list /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2 find-debuginfo: starting Extracting debug info from 1 files DWARF-compressing 1 files dwz: ./usr/lib64/librccl.so.1.0-6.4.2-3.fc43.x86_64.debug: Unknown debugging section .debug_str_offsets sepdebugcrcfix: Updated 0 CRC32s, 1 CRC32s did match. Creating .debug symlinks for symlinks to ELF files Copying sources found by 'debugedit -l' to /usr/src/debug/rccl-6.4.2-3.fc43.x86_64 find-debuginfo: done + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + /usr/lib/rpm/redhat/brp-python-rpm-in-distinfo + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j2 + /usr/lib/rpm/redhat/brp-python-hardlink + /usr/bin/add-determinism --brp -j2 /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT Scanned 38 directories and 316 files, processed 0 inodes, 0 modified (0 replaced + 0 rewritten), 0 unsupported format, 0 errors Reading /builddir/build/BUILD/rccl-6.4.2-build/SPECPARTS/rpm-debuginfo.specpart Processing files: rccl-6.4.2-3.fc43.x86_64 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.eJxos4 + umask 022 + cd /builddir/build/BUILD/rccl-6.4.2-build + cd rccl-rocm-6.4.2 + LICENSEDIR=/builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/licenses/rccl + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/licenses/rccl + cp -pr /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/LICENSE.txt /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/licenses/rccl + RPM_EC=0 ++ jobs -p + exit 0 Provides: librccl.so.1()(64bit) rccl = 6.4.2-3.fc43 rccl(x86-64) = 6.4.2-3.fc43 Requires(interp): /sbin/ldconfig /sbin/ldconfig Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires(post): /sbin/ldconfig Requires(postun): /sbin/ldconfig Requires: ld-linux-x86-64.so.2()(64bit) ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) libamdhip64.so.6()(64bit) libamdhip64.so.6(hip_4.2)(64bit) libamdhip64.so.6(hip_4.3)(64bit) libamdhip64.so.6(hip_4.5)(64bit) libamdhip64.so.6(hip_5.0)(64bit) libamdhip64.so.6(hip_5.3)(64bit) libamdhip64.so.6(hip_6.0)(64bit) libc.so.6()(64bit) libc.so.6(GLIBC_2.10)(64bit) libc.so.6(GLIBC_2.14)(64bit) libc.so.6(GLIBC_2.16)(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.2.5)(64bit) libc.so.6(GLIBC_2.3)(64bit) libc.so.6(GLIBC_2.3.2)(64bit) libc.so.6(GLIBC_2.3.4)(64bit) libc.so.6(GLIBC_2.32)(64bit) libc.so.6(GLIBC_2.33)(64bit) libc.so.6(GLIBC_2.34)(64bit) libc.so.6(GLIBC_2.38)(64bit) libc.so.6(GLIBC_2.4)(64bit) libc.so.6(GLIBC_2.42)(64bit) libc.so.6(GLIBC_2.6)(64bit) libc.so.6(GLIBC_2.7)(64bit) libc.so.6(GLIBC_ABI_DT_RELR)(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_12.0.0)(64bit) libgcc_s.so.1(GCC_3.0)(64bit) libm.so.6()(64bit) libm.so.6(GLIBC_2.2.5)(64bit) librocm_smi64.so.1()(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.7)(64bit) libstdc++.so.6(CXXABI_1.3.9)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.18)(64bit) libstdc++.so.6(GLIBCXX_3.4.19)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.22)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.30)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) Processing files: rccl-devel-6.4.2-3.fc43.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.CiSVqz + umask 022 + cd /builddir/build/BUILD/rccl-6.4.2-build + cd rccl-rocm-6.4.2 + DOCDIR=/builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/doc/rccl-devel + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/doc/rccl-devel + cp -pr /builddir/build/BUILD/rccl-6.4.2-build/rccl-rocm-6.4.2/README.md /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT/usr/share/doc/rccl-devel + RPM_EC=0 ++ jobs -p + exit 0 Provides: cmake(rccl) = 2.22.3 rccl-devel = 6.4.2-3.fc43 rccl-devel(x86-64) = 6.4.2-3.fc43 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: cmake-filesystem(x86-64) librccl.so.1()(64bit) Processing files: rccl-data-6.4.2-3.fc43.noarch Provides: rccl-data = 6.4.2-3.fc43 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debugsource-6.4.2-3.fc43.x86_64 Provides: rccl-debugsource = 6.4.2-3.fc43 rccl-debugsource(x86-64) = 6.4.2-3.fc43 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debuginfo-6.4.2-3.fc43.x86_64 Provides: debuginfo(build-id) = 09cc96ffe5aead4baf0cbfa4d16fcbbdef39243d librccl.so.1.0-6.4.2-3.fc43.x86_64.debug()(64bit) rccl-debuginfo = 6.4.2-3.fc43 rccl-debuginfo(x86-64) = 6.4.2-3.fc43 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Recommends: rccl-debugsource(x86-64) = 6.4.2-3.fc43 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILD/rccl-6.4.2-build/BUILDROOT Wrote: /builddir/build/RPMS/rccl-data-6.4.2-3.fc43.noarch.rpm Wrote: /builddir/build/RPMS/rccl-debuginfo-6.4.2-3.fc43.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-debugsource-6.4.2-3.fc43.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-devel-6.4.2-3.fc43.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-6.4.2-3.fc43.x86_64.rpm Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.9J4D6P + umask 022 + cd /builddir/build/BUILD/rccl-6.4.2-build + test -d /builddir/build/BUILD/rccl-6.4.2-build + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w /builddir/build/BUILD/rccl-6.4.2-build + rm -rf /builddir/build/BUILD/rccl-6.4.2-build + RPM_EC=0 ++ jobs -p + exit 0 Finish: rpmbuild rccl-6.4.2-3.fc43.src.rpm Finish: build phase for rccl-6.4.2-3.fc43.src.rpm INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-rawhide-x86_64-1754764510.102582/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/rccl-6.4.2-3.fc43.src.rpm) Config(child) 253 minutes 26 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "rccl-debugsource", "epoch": null, "version": "6.4.2", "release": "3.fc43", "arch": "x86_64" }, { "name": "rccl-debuginfo", "epoch": null, "version": "6.4.2", "release": "3.fc43", "arch": "x86_64" }, { "name": "rccl-devel", "epoch": null, "version": "6.4.2", "release": "3.fc43", "arch": "x86_64" }, { "name": "rccl", "epoch": null, "version": "6.4.2", "release": "3.fc43", "arch": "x86_64" }, { "name": "rccl", "epoch": null, "version": "6.4.2", "release": "3.fc43", "arch": "src" }, { "name": "rccl-data", "epoch": null, "version": "6.4.2", "release": "3.fc43", "arch": "noarch" } ] } RPMResults finished